Gene EcolC_4260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4260 
Symbol 
ID6068003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4710131 
End bp4711672 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content53% 
IMG OID641603697 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001727183 
Protein GI170022229 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00377542 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0472996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTGA ATTCCACCGA AATCAGCGAA CTGATCAAGC AGCGCATTGC TCAGTTCAAT 
GTTGTGAGTG AAGCTCACAA CGAAGGTACT ATTGTTTCTG TAAGTGACGG TGTTATCCGC
ATTCACGGCC TGGCCGATTG TATGCAGGGT GAAATGATCT CCCTGCCGGG TAACCGTTAC
GCTATCGCAC TGAACCTCGA GCGCGACTCT GTAGGTGCGG TTGTTATGGG TCCGTACGCT
GACCTTGCCG AAGGCATGAA AGTTAAGTGT ACTGGCCGTA TCCTGGAAGT TCCGGTTGGC
CGTGGCCTGC TGGGCCGTGT GGTTAACACT CTGGGTGCAC CAATCGACGG TAAAGGTCCG
CTGGATCACG ACGGCTTCTC TGCTGTAGAA GCAATCGCTC CGGGCGTTAT CGAACGTCAG
TCTGTAGATC AGCCGGTACA GACCGGTTAT AAAGCCGTTG ACTCCATGAT CCCAATCGGT
CGTGGTCAGC GTGAATTGAT CATCGGTGAC CGTCAGACAG GTAAAACCGC ACTGGCTATC
GATGCCATCA TCAACCAGCG CGATTCCGGT ATCAAATGTA TCTATGTCGC TATCGGCCAG
AAAGCGTCCA CCATTTCTAA CGTGGTACGT AAACTGGAAG AGCACGGCGC ACTGGCTAAC
ACCATCGTTG TGGTAGCAAC CGCGTCTGAA TCCGCTGCAC TGCAATACCT GGCACCGTAT
GCCGGTTGCG CAATGGGCGA ATACTTCCGT GACCGCGGTG AAGATGCGCT GATCATTTAC
GATGACCTGT CTAAACAGGC TGTTGCTTAC CGTCAGATCT CCCTGCTGCT CCGTCGTCCG
CCAGGACGTG AAGCATTCCC GGGCGACGTT TTCTACCTCC ACTCTCGTCT GCTGGAGCGT
GCTGCACGTG TTAACGCCGA ATACGTTGAA GCCTTCACCA AAGGTGAAGT GAAAGGGAAA
ACCGGTTCTC TGACCGCACT GCCGATTATC GAAACTCAGG CGGGTGACGT TTCTGCGTTC
GTTCCGACCA ACGTAATCTC CATTACCGAT GGTCAGATCT TCCTGGAAAC CAACCTGTTC
AACGCCGGTA TTCGTCCTGC GGTTAACCCG GGTATTTCCG TATCCCGTGT TGGTGGTGCA
GCACAGACCA AGATCATGAA AAAACTGTCC GGTGGTATCC GTACCGCTCT GGCACAGTAT
CGTGAACTGG CAGCGTTCTC TCAGTTTGCA TCCGACCTTG ACGATGCAAC ACGTAAGCAG
CTTGACCACG GTCAGAAAGT GACCGAACTG CTGAAACAGA AACAGTATGC GCCGATGTCC
GTTGCGCAGC AGTCTCTGGT TCTGTTCGCA GCAGAACGTG GTTACCTGGC GGATGTTGAA
CTGTCGAAAA TTGGCAGCTT CGAAGCCGCT CTGCTGGCTT ACGTCGACCG TGATCACGCT
CCGTTGATGC AAGAGATCAA CCAGACCGGT GGCTACAACG ACGAAATCGA AGGCAAGCTG
AAAGGCATCC TCGATTCCTT CAAAGCAACC CAATCCTGGT AA
 
Protein sequence
MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY 
AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP
LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI
DAIINQRDSG IKCIYVAIGQ KASTISNVVR KLEEHGALAN TIVVVATASE SAALQYLAPY
AGCAMGEYFR DRGEDALIIY DDLSKQAVAY RQISLLLRRP PGREAFPGDV FYLHSRLLER
AARVNAEYVE AFTKGEVKGK TGSLTALPII ETQAGDVSAF VPTNVISITD GQIFLETNLF
NAGIRPAVNP GISVSRVGGA AQTKIMKKLS GGIRTALAQY RELAAFSQFA SDLDDATRKQ
LDHGQKVTEL LKQKQYAPMS VAQQSLVLFA AERGYLADVE LSKIGSFEAA LLAYVDRDHA
PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW