Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4187 |
Symbol | atpA |
ID | 6272020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3910447 |
End bp | 3911988 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641728008 |
Product | F0F1 ATP synthase subunit alpha |
Protein accession | YP_001882429 |
Protein GI | 187731460 |
COG category | [C] Energy production and conversion |
COG ID | [COG0056] F0F1-type ATP synthase, alpha subunit |
TIGRFAM ID | [TIGR00962] proton translocating ATP synthase, F1 alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00431542 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTGA ATTCCACCGA AATCAGCGAA CTGATCAAGC AGCGCATTGC TCAGTTCAAT GTTGTGAGTG AAGCTCACAA CGAAGGTACT ATTGTTTCTG TAAGTGACGG TGTTATCCGC ATTCACGGCC TGGCCGATTG TATGCAGGGT GAAATGATCT CCCTGCCGGG TAACCGTTAC GCTATCGCAC TGAACCTCGA GCGCGACTCT GTAGGTGCGG TTGTTATGGG TCCGTACGCT GACCTTGCCG AAGGCATGAA AGTTAAGTGT ACTGGCCGTA TCCTGGAAGT TCCGGTTGGC CGTGGCCTGC TGGGCCGTGT GGTTAACACT CTGGGTGCAC CAATCGACGG TAAAGGTCCG CTGGATCACG ATGGCTTCTC TGCTGTAGAA GCAATCGCTC CGGGCGTTAT CGAACGTCAG TCCGTAGATC AGCCGGTACA GACCGGTTAT AAAGCCGTTG ACTCCATGAT CCCAATCGGT CGTGGTCAGC GTGAATTGAT CATCGGTGAC CGTCAGACAG GTAAAACCGC ACTGGCTATC GATGCCATCA TCAACCAGCG CGATTCCGGT ATCAAATGTA TCTATGTCGC TATCGGCCAG AAAGCGTCCA CCATTTCTAA CGTGGTACGT AAACTGGAAG AGCACGGCGC ACTGGCTAAC ACCATCGTTG TGGTAGCAAC CGCGTCTGAA TCCGCTGCAC TGCAATACCT GGCACCGTAT GCCGGTTGCG CAATGGGCGA ATACTTCCGT GACCGCGGTG AAGATGCGCT GATCATTTAC GATGACCTGT CTAAACAGGC TGTTGCTTAC CGTCAGATCT CCCTGCTGCT CCGTCGTCCG CCAGGACGTG AAGCATTCCC GGGCGACGTT TTCTACCTCC ACTCTCGTCT GCTGGAGCGT GCTGCACGTG TTAACGCCGA ATACGTTGAA GCCTTCACCA AAGGTGAAGT GAAAGGGAAA AACGGTTCTC TGACCGCACT GCCGATTATC GAAACTCAGG CGGGTGACGT TTCTGCGTTC GTTCCGACCA ACGTAATCTC CATTACCGAT GGTCAGATCT TCCTGGAAAC CAACCTGTTC AACGCCGGTA TTCGTCCTGC GGTTAACCCG GGTATTTCCG TATCCCGTGT TGGTGGTGCA GCACAGACCA AGATCATGAA AAAACTGTCC GGTGGTATCC GTACCGCTCT GGCACAGTAT CGTGAACTGG CAGCGTTCTC TCAGTTTGCA TCCGACCTTG ACGATGCAAC ACGTAAGCAG CTTGACCACG GTCAGAAAGT GACCGAACTG CTGAAACAGA AACAGTATGC GCCGATGTCC GTTGCGCAGC AGTCTCTGGT TCTGTTCGCA GCAGAACGTG GTTACCTGGC GGATGTTGAA CTGTCGAAAA TTGGCAGCTT CGAAGCCGCT CTGCTGGCTT ACGTCGACCG TGATCACGCT CCGTTGATGC AAGAGATCAA CCAGACCGGT GGCTACAACG ACGAAATCGA AGGCAAACTG AAAGGCATCC TCGATTCCTT CAAAGCAACC CAATCCTGGT AA
|
Protein sequence | MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI DAIINQRDSG IKCIYVAIGQ KASTISNVVR KLEEHGALAN TIVVVATASE SAALQYLAPY AGCAMGEYFR DRGEDALIIY DDLSKQAVAY RQISLLLRRP PGREAFPGDV FYLHSRLLER AARVNAEYVE AFTKGEVKGK NGSLTALPII ETQAGDVSAF VPTNVISITD GQIFLETNLF NAGIRPAVNP GISVSRVGGA AQTKIMKKLS GGIRTALAQY RELAAFSQFA SDLDDATRKQ LDHGQKVTEL LKQKQYAPMS VAQQSLVLFA AERGYLADVE LSKIGSFEAA LLAYVDRDHA PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW
|
| |