Gene Athe_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0203 
Symbol 
ID7407194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp250512 
End bp252398 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content40% 
IMG OID643714604 
ProductABC transporter related 
Protein accessionYP_002572127 
Protein GI222528245 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000017604 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAA GACAGACAAC ATCGTACAGA CCTCATATAC AAAGACCGAG AAGACGTGGG 
CCTGGTGGAC CTATGGGACC TGGGTTTGTG GGTGAAAAGC CCAAGGATTT TAAGACTGCT
ATGAAAAAGC TCATAAGGTA TCTATCTGCT TACAAGGTTT CACTTGTTGC AGTAATTGTT
CTTGCAATGC TATCTGCTGC ATTTTCAATT GCAGGACCTA AAATACTCAG CAAAGCAATA
ACAAAGATAT TCGAAGGCAT CATGAATAGA ATAACAGGCA CAGGAAACGG CATTGACTTT
GAGTATGTTG GTAAAATCGT TTTAATTTTG CTGGGGCTGT ATATTGTAAG TGCTCTTTTT
GGCTACATTC AGGGCTGGAT AATGTCAGGC ATTTCGATGA AGTTAACGTA CAGGCTCAGA
AAAGAGATCT CACAAAAGAT TAACAGGCTT CCTTTGAAGT ACTTTGAGGG CACAAACCAG
GGTGAGATAC TGTCAAGAAT CACAAATGAT GTTGACACAC TCACACAGAC TTTAAATCAG
AGCCTAACAC AGATAATAAC CTCAACAACC ATGGTTATCG GCGCACTTGT TATGATGCTC
AGCATAAATG TCTTGATGAC AGTTGTTGCA CTGCTTATAA TTCCTCTTTC TTTTTCGGTT
GTTGCGTTCA TAATTGGGAA GTCACAAAAG TTTTTCATGC AGCAGCAAGA ATATTTAGGG
CATGTGAATG GTCATGTTGA AGAGGTTTAC GGTGGTCACA TTGTTATCAA GGCTTTCAAT
GCTGAAAAAA AGAGTATAGA AAAGTTCAAT AATCTTAACA ACAAGTTATA TGAAGCTGCA
TGGAAATCAC AGTTTTTGAC AGGCGTCATG ATGCCGCTTA TGAACATCAT AGGGAATCTT
GGATACGTTG TTGTGACTGT CATGGGCAGC TATCTTACAA TAAAAGGAGC AATTGAGGTT
GGCGACATTC AGGCGTTTGT CCAGTATATA AGGTCGTTCA CACAGCCAAT TGCCCAAATT
GCTAACATAT CAAACATCCT GCAGCAGACA GCTGCTTGTA GCGAAAGGGT GTTTGAGTTT
TTAGAAGAAG AGGAAGAAGT GCCAGATACA CCAAATCCGG AGATTAAGCT TGACAGCATA
AAAGGAGATG TAGAGTTTAG AAACGTCAAG TTTGGCTACA GGCCAGACAA AGTTGTTATA
AAGAACTTTT CAGCAAAAAT CAAAGCTGGG CAGAAGATTG CAATTGTTGG TCCAACAGGT
GCGGGTAAAA CTACCATTGT AAAACTTTTA ATGAGGTATT ACGATGTGAA TGATGGTGCG
ATTTTGATAG ATGGGCATGA TATAAGGGAG TTCAAACGTG AAGATTTGAG ATCGCTTTTT
GGAATGGTAT TGCAGGACAC ATGGCTGTAC AATGGCACAA TCAAAGACAA CATCCGCTAT
GGCAAGCCAG ATGCAACAGA TGAAGAAGTA ATAAGAGCTG CAAAGCTTGC ACACGTTGAC
CATTTTATAA GGACACTACC TCAAGGATAT GACACCGTTT TGAATGAGGA GACAACAAAT
ATTTCTCAAG GTCAAAAACA GCTTTTGACA ATTGCAAGGG CAATCCTCAA AGACCCCAAA
ATTTTGATAC TTGACGAGGC AACAAGCTCT GTTGATACTT TGACAGAGAT CCAGATACAA
AAGGCAATGG ACAATCTCAT GAAAGGAAGA ACATCGTTTA TAATAGCCCA CAGGCTTTCA
ACAATAAGAA ACGCAGACCT CATTTTGGTC ATGGACCATG GCGACATTGT TGAGCAAGGT
ACACACAAAG AGCTTTTGCA AAAAGGCGGA TTTTATGCTC AGCTTTACTA CAGCCAGTTC
GAAAAGGAAG AAGAGCTTGC AGGATAA
 
Protein sequence
MSERQTTSYR PHIQRPRRRG PGGPMGPGFV GEKPKDFKTA MKKLIRYLSA YKVSLVAVIV 
LAMLSAAFSI AGPKILSKAI TKIFEGIMNR ITGTGNGIDF EYVGKIVLIL LGLYIVSALF
GYIQGWIMSG ISMKLTYRLR KEISQKINRL PLKYFEGTNQ GEILSRITND VDTLTQTLNQ
SLTQIITSTT MVIGALVMML SINVLMTVVA LLIIPLSFSV VAFIIGKSQK FFMQQQEYLG
HVNGHVEEVY GGHIVIKAFN AEKKSIEKFN NLNNKLYEAA WKSQFLTGVM MPLMNIIGNL
GYVVVTVMGS YLTIKGAIEV GDIQAFVQYI RSFTQPIAQI ANISNILQQT AACSERVFEF
LEEEEEVPDT PNPEIKLDSI KGDVEFRNVK FGYRPDKVVI KNFSAKIKAG QKIAIVGPTG
AGKTTIVKLL MRYYDVNDGA ILIDGHDIRE FKREDLRSLF GMVLQDTWLY NGTIKDNIRY
GKPDATDEEV IRAAKLAHVD HFIRTLPQGY DTVLNEETTN ISQGQKQLLT IARAILKDPK
ILILDEATSS VDTLTEIQIQ KAMDNLMKGR TSFIIAHRLS TIRNADLILV MDHGDIVEQG
THKELLQKGG FYAQLYYSQF EKEEELAG