Gene Sare_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0501 
Symbol 
ID5703306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp570992 
End bp572581 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content72% 
IMG OID641270027 
Productmajor facilitator transporter 
Protein accessionYP_001535421 
Protein GI159036168 
COG category 
COG ID 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.406138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00103191 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCGTCGT CCCTCCGCTC CGAGCGCTCC TTCCTCGGGC GAACCGTCGG CACGGGCATC 
AAGGCCGTCC GCCTACTGCT GCGTGGCTCA GTCAGCGGCG GTCGTTGGAT GACCCAGTCG
GCGGGACGGG CCCGCGCCCG GGGGGCCGGC CGGGAGATCG GCATGGTCCG TCTGTTCGAC
CTGCACGCGG TCTCCTGCGC TGGCGACACG CTGATCGCAA TCGGCCTGGC CGGAACGATC
TTCTTCAACG TGCCACTCGG TGAGGCACGC AGCAAGGTCG CGCTCTACCT GCTGGTGACC
ATGATCCCGT TCGCCCTGCT CGCCCCCGTG GTCGGTCCGC TACTCGATCA CTTCCGGCAC
GGCCGGAGGT ACGCCCTGGC CACCAGCATG CTCGGGCGGG CGTTCCTGGC CTGGCTGATC
TCCGACTACA TCCACGGCTT CGGCCTGTAC CCGGCGGCCT TCGGGGTGCT GGCGCTCTCC
CGCGCGTACG GGGTGGCCCG GTCGGCCGCC GTGCCCCGGC TGCTCCCGGA AGGGCTCGGG
CTGTCCCAGG TCGGGGCACG GGCCAGCGTG TACGGGACAC TGGCCGGGGC CCTGGTCGCG
CCCATCGGCC TGGCCGCCTT CTGGTTCGGG CCGCAGTGGC CCCTGCGGGT CGCCTCGGTC
ATCTTCCTGG TCGGCATGGT CGTCGCCCTG CGGCTGCCAC CGAAGGCCGA CTCGGAGCCG
CCCGAGCGGG TACCGCGTCC CCTGCGGGCG TTCAGGCGAG GCTCGGGTGA CCGGCCCCTG
GGCCGCGGGC GCCCGCACGG CCGGCTGGTG ATCGCGACTC TGATCGGTGC CGCCACCCTA
CGTGCGGTCT ACGGCTTCCT GCTGCTCTTC CTGGCCTTCG CGATCAAGGC CGGTGACCTG
ACCACCGTCG TCCTCGGCCG AGACCTGAGC GACGAGGCCG CCCTCGGACT GGTCGGCGGG
GCGCTGGCGA TCGGTAGCTT CCTGGCCACC GCGATCGGCA CCCGGCTGCG TATCCACCGG
CCGGCGACGC TCCAGTCCAG CGGCATGGTG ATCATCGCTG GGGTGGCCGT ACTCGCCACA
CTCAAGTTCT CGCTACTGAT GGTCGCCCTG CTCTGCCTGG TCGCCGCGCT GATGAGCGGA
ATCGCCAAGC TCGCAGTGGA CGCCTCGATC CAGGAACGTA TCCCCGAACG ACTGCGCGCC
AGCTCATTCG CCCACTCGGA GACGGTTCTC ATGCTCGCCT TCGTCGCTGG GGGTGGGCTG
GGCCTCGTTC CGTTCGACGG TCAGCTCGGC ATCGCCGTCG CCGCCGGGGT CGCAACGCTC
GCCACAGCCC GGGGCATCGT GGTCGCCGGC CGGCTACGGG CGGAGCGACT GGCCGGCCGA
CCGCTGGCCG ATGACGAGCT CACCGAAGGC GACACGGACC CGGCTGAACC CGCCTCCGGC
GACCCCGCAC CGACCTCACC CGCCCCGACA CCGGAGGCCG AGGGTGGCGA TGACCTGGCA
CCACCGGGTT TCCACATCTA CCGCCCGTCG TCGGCGGTCG GTGGGCCCGG CGGCACCGAC
GAAGAGACCC GCCAGGGGCC CATCCCGTGA
 
Protein sequence
MPSSLRSERS FLGRTVGTGI KAVRLLLRGS VSGGRWMTQS AGRARARGAG REIGMVRLFD 
LHAVSCAGDT LIAIGLAGTI FFNVPLGEAR SKVALYLLVT MIPFALLAPV VGPLLDHFRH
GRRYALATSM LGRAFLAWLI SDYIHGFGLY PAAFGVLALS RAYGVARSAA VPRLLPEGLG
LSQVGARASV YGTLAGALVA PIGLAAFWFG PQWPLRVASV IFLVGMVVAL RLPPKADSEP
PERVPRPLRA FRRGSGDRPL GRGRPHGRLV IATLIGAATL RAVYGFLLLF LAFAIKAGDL
TTVVLGRDLS DEAALGLVGG ALAIGSFLAT AIGTRLRIHR PATLQSSGMV IIAGVAVLAT
LKFSLLMVAL LCLVAALMSG IAKLAVDASI QERIPERLRA SSFAHSETVL MLAFVAGGGL
GLVPFDGQLG IAVAAGVATL ATARGIVVAG RLRAERLAGR PLADDELTEG DTDPAEPASG
DPAPTSPAPT PEAEGGDDLA PPGFHIYRPS SAVGGPGGTD EETRQGPIP