Gene SbBS512_E4187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4187 
SymbolatpA 
ID6272020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3910447 
End bp3911988 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content53% 
IMG OID641728008 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001882429 
Protein GI187731460 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00431542 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTGA ATTCCACCGA AATCAGCGAA CTGATCAAGC AGCGCATTGC TCAGTTCAAT 
GTTGTGAGTG AAGCTCACAA CGAAGGTACT ATTGTTTCTG TAAGTGACGG TGTTATCCGC
ATTCACGGCC TGGCCGATTG TATGCAGGGT GAAATGATCT CCCTGCCGGG TAACCGTTAC
GCTATCGCAC TGAACCTCGA GCGCGACTCT GTAGGTGCGG TTGTTATGGG TCCGTACGCT
GACCTTGCCG AAGGCATGAA AGTTAAGTGT ACTGGCCGTA TCCTGGAAGT TCCGGTTGGC
CGTGGCCTGC TGGGCCGTGT GGTTAACACT CTGGGTGCAC CAATCGACGG TAAAGGTCCG
CTGGATCACG ATGGCTTCTC TGCTGTAGAA GCAATCGCTC CGGGCGTTAT CGAACGTCAG
TCCGTAGATC AGCCGGTACA GACCGGTTAT AAAGCCGTTG ACTCCATGAT CCCAATCGGT
CGTGGTCAGC GTGAATTGAT CATCGGTGAC CGTCAGACAG GTAAAACCGC ACTGGCTATC
GATGCCATCA TCAACCAGCG CGATTCCGGT ATCAAATGTA TCTATGTCGC TATCGGCCAG
AAAGCGTCCA CCATTTCTAA CGTGGTACGT AAACTGGAAG AGCACGGCGC ACTGGCTAAC
ACCATCGTTG TGGTAGCAAC CGCGTCTGAA TCCGCTGCAC TGCAATACCT GGCACCGTAT
GCCGGTTGCG CAATGGGCGA ATACTTCCGT GACCGCGGTG AAGATGCGCT GATCATTTAC
GATGACCTGT CTAAACAGGC TGTTGCTTAC CGTCAGATCT CCCTGCTGCT CCGTCGTCCG
CCAGGACGTG AAGCATTCCC GGGCGACGTT TTCTACCTCC ACTCTCGTCT GCTGGAGCGT
GCTGCACGTG TTAACGCCGA ATACGTTGAA GCCTTCACCA AAGGTGAAGT GAAAGGGAAA
AACGGTTCTC TGACCGCACT GCCGATTATC GAAACTCAGG CGGGTGACGT TTCTGCGTTC
GTTCCGACCA ACGTAATCTC CATTACCGAT GGTCAGATCT TCCTGGAAAC CAACCTGTTC
AACGCCGGTA TTCGTCCTGC GGTTAACCCG GGTATTTCCG TATCCCGTGT TGGTGGTGCA
GCACAGACCA AGATCATGAA AAAACTGTCC GGTGGTATCC GTACCGCTCT GGCACAGTAT
CGTGAACTGG CAGCGTTCTC TCAGTTTGCA TCCGACCTTG ACGATGCAAC ACGTAAGCAG
CTTGACCACG GTCAGAAAGT GACCGAACTG CTGAAACAGA AACAGTATGC GCCGATGTCC
GTTGCGCAGC AGTCTCTGGT TCTGTTCGCA GCAGAACGTG GTTACCTGGC GGATGTTGAA
CTGTCGAAAA TTGGCAGCTT CGAAGCCGCT CTGCTGGCTT ACGTCGACCG TGATCACGCT
CCGTTGATGC AAGAGATCAA CCAGACCGGT GGCTACAACG ACGAAATCGA AGGCAAACTG
AAAGGCATCC TCGATTCCTT CAAAGCAACC CAATCCTGGT AA
 
Protein sequence
MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY 
AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP
LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI
DAIINQRDSG IKCIYVAIGQ KASTISNVVR KLEEHGALAN TIVVVATASE SAALQYLAPY
AGCAMGEYFR DRGEDALIIY DDLSKQAVAY RQISLLLRRP PGREAFPGDV FYLHSRLLER
AARVNAEYVE AFTKGEVKGK NGSLTALPII ETQAGDVSAF VPTNVISITD GQIFLETNLF
NAGIRPAVNP GISVSRVGGA AQTKIMKKLS GGIRTALAQY RELAAFSQFA SDLDDATRKQ
LDHGQKVTEL LKQKQYAPMS VAQQSLVLFA AERGYLADVE LSKIGSFEAA LLAYVDRDHA
PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW