Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0203 |
Symbol | |
ID | 7407194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 250512 |
End bp | 252398 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643714604 |
Product | ABC transporter related |
Protein accession | YP_002572127 |
Protein GI | 222528245 |
COG category | [V] Defense mechanisms |
COG ID | [COG1132] ABC-type multidrug transport system, ATPase and permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000017604 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAA GACAGACAAC ATCGTACAGA CCTCATATAC AAAGACCGAG AAGACGTGGG CCTGGTGGAC CTATGGGACC TGGGTTTGTG GGTGAAAAGC CCAAGGATTT TAAGACTGCT ATGAAAAAGC TCATAAGGTA TCTATCTGCT TACAAGGTTT CACTTGTTGC AGTAATTGTT CTTGCAATGC TATCTGCTGC ATTTTCAATT GCAGGACCTA AAATACTCAG CAAAGCAATA ACAAAGATAT TCGAAGGCAT CATGAATAGA ATAACAGGCA CAGGAAACGG CATTGACTTT GAGTATGTTG GTAAAATCGT TTTAATTTTG CTGGGGCTGT ATATTGTAAG TGCTCTTTTT GGCTACATTC AGGGCTGGAT AATGTCAGGC ATTTCGATGA AGTTAACGTA CAGGCTCAGA AAAGAGATCT CACAAAAGAT TAACAGGCTT CCTTTGAAGT ACTTTGAGGG CACAAACCAG GGTGAGATAC TGTCAAGAAT CACAAATGAT GTTGACACAC TCACACAGAC TTTAAATCAG AGCCTAACAC AGATAATAAC CTCAACAACC ATGGTTATCG GCGCACTTGT TATGATGCTC AGCATAAATG TCTTGATGAC AGTTGTTGCA CTGCTTATAA TTCCTCTTTC TTTTTCGGTT GTTGCGTTCA TAATTGGGAA GTCACAAAAG TTTTTCATGC AGCAGCAAGA ATATTTAGGG CATGTGAATG GTCATGTTGA AGAGGTTTAC GGTGGTCACA TTGTTATCAA GGCTTTCAAT GCTGAAAAAA AGAGTATAGA AAAGTTCAAT AATCTTAACA ACAAGTTATA TGAAGCTGCA TGGAAATCAC AGTTTTTGAC AGGCGTCATG ATGCCGCTTA TGAACATCAT AGGGAATCTT GGATACGTTG TTGTGACTGT CATGGGCAGC TATCTTACAA TAAAAGGAGC AATTGAGGTT GGCGACATTC AGGCGTTTGT CCAGTATATA AGGTCGTTCA CACAGCCAAT TGCCCAAATT GCTAACATAT CAAACATCCT GCAGCAGACA GCTGCTTGTA GCGAAAGGGT GTTTGAGTTT TTAGAAGAAG AGGAAGAAGT GCCAGATACA CCAAATCCGG AGATTAAGCT TGACAGCATA AAAGGAGATG TAGAGTTTAG AAACGTCAAG TTTGGCTACA GGCCAGACAA AGTTGTTATA AAGAACTTTT CAGCAAAAAT CAAAGCTGGG CAGAAGATTG CAATTGTTGG TCCAACAGGT GCGGGTAAAA CTACCATTGT AAAACTTTTA ATGAGGTATT ACGATGTGAA TGATGGTGCG ATTTTGATAG ATGGGCATGA TATAAGGGAG TTCAAACGTG AAGATTTGAG ATCGCTTTTT GGAATGGTAT TGCAGGACAC ATGGCTGTAC AATGGCACAA TCAAAGACAA CATCCGCTAT GGCAAGCCAG ATGCAACAGA TGAAGAAGTA ATAAGAGCTG CAAAGCTTGC ACACGTTGAC CATTTTATAA GGACACTACC TCAAGGATAT GACACCGTTT TGAATGAGGA GACAACAAAT ATTTCTCAAG GTCAAAAACA GCTTTTGACA ATTGCAAGGG CAATCCTCAA AGACCCCAAA ATTTTGATAC TTGACGAGGC AACAAGCTCT GTTGATACTT TGACAGAGAT CCAGATACAA AAGGCAATGG ACAATCTCAT GAAAGGAAGA ACATCGTTTA TAATAGCCCA CAGGCTTTCA ACAATAAGAA ACGCAGACCT CATTTTGGTC ATGGACCATG GCGACATTGT TGAGCAAGGT ACACACAAAG AGCTTTTGCA AAAAGGCGGA TTTTATGCTC AGCTTTACTA CAGCCAGTTC GAAAAGGAAG AAGAGCTTGC AGGATAA
|
Protein sequence | MSERQTTSYR PHIQRPRRRG PGGPMGPGFV GEKPKDFKTA MKKLIRYLSA YKVSLVAVIV LAMLSAAFSI AGPKILSKAI TKIFEGIMNR ITGTGNGIDF EYVGKIVLIL LGLYIVSALF GYIQGWIMSG ISMKLTYRLR KEISQKINRL PLKYFEGTNQ GEILSRITND VDTLTQTLNQ SLTQIITSTT MVIGALVMML SINVLMTVVA LLIIPLSFSV VAFIIGKSQK FFMQQQEYLG HVNGHVEEVY GGHIVIKAFN AEKKSIEKFN NLNNKLYEAA WKSQFLTGVM MPLMNIIGNL GYVVVTVMGS YLTIKGAIEV GDIQAFVQYI RSFTQPIAQI ANISNILQQT AACSERVFEF LEEEEEVPDT PNPEIKLDSI KGDVEFRNVK FGYRPDKVVI KNFSAKIKAG QKIAIVGPTG AGKTTIVKLL MRYYDVNDGA ILIDGHDIRE FKREDLRSLF GMVLQDTWLY NGTIKDNIRY GKPDATDEEV IRAAKLAHVD HFIRTLPQGY DTVLNEETTN ISQGQKQLLT IARAILKDPK ILILDEATSS VDTLTEIQIQ KAMDNLMKGR TSFIIAHRLS TIRNADLILV MDHGDIVEQG THKELLQKGG FYAQLYYSQF EKEEELAG
|
| |