Gene EcSMS35_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4102 
SymbolatpA 
ID6143530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4198115 
End bp4199656 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content53% 
IMG OID641618926 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001746064 
Protein GI170682498 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.052842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.176896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTGA ATTCCACCGA AATCAGCGAA CTGATCAAGC AGCGCATTGC TCAGTTCAAT 
GTTGTGAGTG AAGCTCACAA CGAAGGTACT ATTGTTTCTG TAAGTGACGG TGTTATCCGC
ATTCACGGCC TGGCCGATTG TATGCAGGGT GAAATGATCT CCCTGCCGGG TAACCGTTAC
GCTATCGCAC TGAACCTCGA GCGCGACTCT GTAGGTGCGG TTGTTATGGG TCCGTACGCT
GACCTTGCCG AAGGCATGAA AGTTAAGTGT ACTGGCCGTA TCCTGGAAGT TCCGGTTGGC
CGTGGCCTGC TGGGCCGTGT GGTTAACACT CTGGGTGCAC CAATCGATGG TAAAGGTCCG
CTGGATCACG ACGGCTTCTC TGCTGTAGAA GCAATCGCTC CGGGCGTTAT CGAACGTCAG
TCCGTAGATC AGCCGGTACA GACCGGTTAT AAAGCCGTTG ACTCCATGAT CCCAATCGGT
CGTGGTCAGC GTGAATTGAT CATCGGTGAC CGTCAGACCG GTAAAACCGC ACTGGCTATC
GATGCCATCA TCAACCAGCG CGATTCCGGT ATCAAATGTA TCTATGTCGC TATCGGCCAG
AAAGCGTCCA CCATTTCTAA CGTGGTACGT AAACTGGAAG AGCACGGCGC ACTGGCTAAC
ACCATCGTTG TGGTAGCAAC CGCGTCTGAA TCCGCTGCAC TGCAATACCT GGCACCGTAC
GCTGGTTGCG CAATGGGCGA ATACTTCCGT GACCGCGGTG AAGATGCGCT GATCATTTAC
GATGACCTGT CTAAACAGGC TGTTGCTTAC CGTCAGATCT CCCTGCTGCT CCGTCGTCCG
CCAGGACGTG AAGCATTCCC TGGCGACGTA TTCTACCTCC ACTCTCGTCT GCTGGAGCGT
GCTGCGCGTG TTAACGCCGA ATACGTTGAA GCCTTCACCA AAGGTGAAGT GAAAGGGAAA
ACCGGTTCTC TGACTGCGCT GCCGATTATC GAAACTCAGG CGGGTGACGT TTCTGCGTTC
GTTCCGACCA ACGTAATCTC CATTACCGAT GGTCAGATCT TCCTGGAAAC CAACCTGTTC
AACGCCGGTA TTCGTCCTGC GGTTAACCCG GGTATTTCCG TATCCCGTGT TGGTGGTGCA
GCACAGACCA AGATCATGAA AAAACTGTCC GGTGGTATCC GTACCGCTCT GGCACAGTAT
CGTGAACTGG CAGCGTTCTC TCAGTTTGCA TCCGACCTTG ACGATGCAAC ACGTAAGCAG
CTTGACCACG GTCAGAAAGT GACCGAACTG CTGAAACAGA AACAGTATGC GCCGATGTCC
GTAGCGCAGC AGTCTCTGGT TCTGTTCGCA GCAGAACGTG GTTACCTGGC GGATGTTGAA
CTGTCGAAAA TCGGCAGCTT CGAAGCCGCT CTGCTGGCTT ACGTCGACCG TGATCACGCT
CCGTTGATGC AAGAGATCAA CCAGACCGGT GGCTACAACG ACGAAATCGA AGGCAAGCTG
AAAGGCATCC TCGATTCCTT CAAAGCAACC CAATCCTGGT AA
 
Protein sequence
MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY 
AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP
LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI
DAIINQRDSG IKCIYVAIGQ KASTISNVVR KLEEHGALAN TIVVVATASE SAALQYLAPY
AGCAMGEYFR DRGEDALIIY DDLSKQAVAY RQISLLLRRP PGREAFPGDV FYLHSRLLER
AARVNAEYVE AFTKGEVKGK TGSLTALPII ETQAGDVSAF VPTNVISITD GQIFLETNLF
NAGIRPAVNP GISVSRVGGA AQTKIMKKLS GGIRTALAQY RELAAFSQFA SDLDDATRKQ
LDHGQKVTEL LKQKQYAPMS VAQQSLVLFA AERGYLADVE LSKIGSFEAA LLAYVDRDHA
PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW