Gene EcHS_A3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3950 
SymbolatpA 
ID5591043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3945495 
End bp3947036 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content53% 
IMG OID640923057 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001460534 
Protein GI157163216 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.0104398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTGA ATTCCACCGA AATCAGCGAA CTGATCAAGC AGCGCATTGC TCAGTTCAAT 
GTTGTGAGTG AAGCTCACAA CGAAGGTACT ATTGTTTCTG TAAGTGACGG TGTTATCCGC
ATTCACGGCC TGGCCGATTG TATGCAGGGT GAAATGATCT CCCTGCCGGG TAACCGTTAC
GCTATCGCAC TGAACCTCGA GCGCGACTCT GTAGGTGCGG TTGTTATGGG TCCGTACGCT
GACCTTGCCG AAGGCATGAA AGTTAAGTGT ACTGGCCGTA TCCTGGAAGT TCCGGTTGGC
CGTGGCCTGC TGGGCCGTGT GGTTAACACT CTGGGTGCAC CAATCGACGG TAAAGGTCCG
CTGGATCACG ACGGCTTCTC TGCTGTAGAA GCAATCGCTC CGGGCGTTAT CGAACGTCAG
TCCGTAGATC AGCCGGTACA GACCGGTTAT AAAGCCGTTG ACTCCATGAT CCCAATCGGT
CGTGGTCAGC GTGAATTGAT CATCGGTGAC CGTCAGACAG GTAAAACCGC ACTGGCTATC
GATGCCATCA TCAACCAGCG CGATTCCGGT ATCAAATGTA TCTATGTCGC TATCGGCCAG
AAAGCGTCCA CCATTTCTAA CGTGGTACGT AAACTGGAAG AGCACGGCGC ACTGGCTAAC
ACCATCGTTG TGGTAGCAAC CGCGTCTGAA TCCGCTGCAC TGCAATACCT GGCACCGTAT
GCCGGTTGCG CAATGGGCGA ATACTTCCGT GACCGCGGTG AAGATGCGCT GATCATTTAC
GATGACCTGT CTAAACAGGC TGTTGCTTAC CGTCAGATCT CCCTGCTGCT CCGTCGTCCG
CCAGGACGTG AAGCATTCCC GGGCGACGTT TTCTACCTCC ACTCTCGTCT GCTGGAGCGT
GCTGCACGTG TTAACGCCGA ATACGTTGAA GCCTTCACCA AAGGTGAAGT GAAAGGGAAA
ACCGGTTCTC TGACCGCACT GCCGATTATC GAAACTCAGG CGGGTGACGT TTCTGCGTTC
GTTCCGACCA ACGTAATCTC CATTACCGAT GGTCAGATCT TCCTGGAAAC CAACCTGTTC
AACGCCGGTA TTCGTCCTGC GGTTAACCCG GGTATTTCCG TATCCCGTGT TGGTGGTGCA
GCACAGACCA AGATCATGAA AAAACTGTCC GGTGGTATCC GTACCGCTCT GGCACAGTAT
CGTGAACTGG CAGCGTTCTC TCAGTTTGCA TCCGACCTTG ACGATGCAAC ACGTAAGCAG
CTTGACCACG GTCAGAAAGT GACCGAACTG CTGAAACAGA AACAGTATGC GCCGATGTCC
GTTGCGCAGC AGTCTCTGGT TCTGTTCGCA GCAGAACGTG GTTACCTGGC GGATGTTGAA
CTGTCGAAAA TTGGCAGCTT CGAAGCCGCT CTGCTGGCTT ACGTCGACCG TGATCACGCT
CCGTTGATGC AAGAGATCAA CCAGACCGGT GGCTACAACG ACGAAATCGA AGGCAAGCTG
AAAGGCATCC TCGATTCCTT CAAAGCAACC CAATCCTGGT AA
 
Protein sequence
MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY 
AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP
LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI
DAIINQRDSG IKCIYVAIGQ KASTISNVVR KLEEHGALAN TIVVVATASE SAALQYLAPY
AGCAMGEYFR DRGEDALIIY DDLSKQAVAY RQISLLLRRP PGREAFPGDV FYLHSRLLER
AARVNAEYVE AFTKGEVKGK TGSLTALPII ETQAGDVSAF VPTNVISITD GQIFLETNLF
NAGIRPAVNP GISVSRVGGA AQTKIMKKLS GGIRTALAQY RELAAFSQFA SDLDDATRKQ
LDHGQKVTEL LKQKQYAPMS VAQQSLVLFA AERGYLADVE LSKIGSFEAA LLAYVDRDHA
PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW