Gene Cthe_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1501 
Symbol 
ID4810539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1824403 
End bp1826139 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content40% 
IMG OID640106921 
ProductABC transporter, transmembrane region 
Protein accessionYP_001037922 
Protein GI125974012 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.014393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCGGT ATTGGAAGTA TGTAAAACCG TATTTATCTG CATTTATCAT AGGGCCGATT 
TTTATGATTG TGGAAGTTAT CGGTGAAGTG ATTATGCCTC TGCTACTGAG CAATATCATT
GATAAGGGAG TTATTGGCGG CAGAGGTATT TCATATATAG TAACAATGGG TATAATAATG
ATAGTCACAG CGCTGTTCAT GATGACAGGC GGTGTGGGAG GAGCCTATTT TGCAATTAAA
GCATCCACGG GATTTGCCAA CGATTTGAGA AAAGATTTGT TCAGGCAAAT TCAGAAATTT
TCCTTTAACA ATATCGATCA ATATAATACC GGTTCTTTGA TTACACGGTT AACCAATGAT
ATAACCCAGA TACAAAATAT GATTCAAATG ATGCTGAGAT TGGCCCTGAG GGCTCCCGGT
ATGCTCATTG GTGCTCTTAT AATGGCTTTT GCCATGAATG CCAGACTGGC ATTGGTAATC
CTGTGCATAA TACCTTTGCT GTCCCTGGCA GTCTACTTCA TAATTAAAGT TGCATTTCCT
CTGTTTATAA CCATGCAGAA AAAGCTGGAC GCGCTTAACT CCACCACTCA GGAAAATTTA
ACCAATATCA GGGTGGTAAA ATCATTTGTC AGAGAACAGT ATGAAGAAGA AAAATTCAAA
AAAGCCAACA TAGACTTGAA AGAAAGCACA ATGAAGGCAA TGAAGATTGT TATCTTTACC
ATACCGGCCA TGAGCCTTGC AATGAACATC ACAATTTTGG CAGTGGTTTG GTTTGGCGGA
AAACAGATTA TTGCGGGCAG TATGACCTCC GGTGTCCTTA CCGCATTTGT AAATTATGTT
ATCCAGATAC TCATATCACT TGTAATGGTT TCATTTATTA TTCTAAACAG CTCCAGAACC
CTTGCATCCG TAAGGCGTAT CAATGAGGTG CTTGATACAG ATATCGACCT GACTGACGAG
AATGCCGTGT ACAAGGATAA AACTGTTGAA TACGGCAAAA TCGAATTTAG AGAAGTTTAC
TTTAAATACT ATAAAAACAA TGAAAAATGG GTTTTGGAAA ATATCAACCT CGTGATAAAT
CCGGGTGAGA CGGTCGGTAT TATCGGCTCA ACCGGCAGCG GCAAATCAAG CCTTGTGCAG
CTGATTCCAA GACTGTATGA TGTGGATTTT GGCGAAGTGC TGGTGGACGA CGTCAATGTT
AAAGATTATT CGCTTAAAAA TCTTAGAAAC GGCGTTGGCA TTGTACTGCA AAAGAATGTC
CTGTTTTCCG GTACTATAAT GGAAAACCTC AAATGGGGAG ACGAAAATGC CGGTGACGAT
GAGGTGTACA AGTTTGCAGA AAGTGCTCAG GCACATGGTT TTATCACTTC CTTTGAAAAG
GGATATGATA CCGAGCTTGG ACAGGGCGGT GTCAATGTAT CCGGAGGACA AAAGCAAAGA
TTGTGCATTG CCAGGGCATT GTTGAAAAAG CCCAAAATTT TAATACTTGA TGACAGTACC
AGCGCAGTTG ATACTGCGAC TGAGGCAAAA ATCAGAGAAA GTTTCAGAAA TGAGTTGAAA
GGTACAACGA AAATTATCAT TGCGCAAAGA ATCTCTTCGG TAATTGACGC GGACAAAATA
ATTGTACTTG ATGACGGAAA AATTGTCGGC ATAGGTAATC ACGAAGAATT AATGAAGAAT
TGCGAAACAT ATTCAGAGAT TTATTACTCA CAGATGGATA AGAAGGTGAC TGCATAA
 
Protein sequence
MRRYWKYVKP YLSAFIIGPI FMIVEVIGEV IMPLLLSNII DKGVIGGRGI SYIVTMGIIM 
IVTALFMMTG GVGGAYFAIK ASTGFANDLR KDLFRQIQKF SFNNIDQYNT GSLITRLTND
ITQIQNMIQM MLRLALRAPG MLIGALIMAF AMNARLALVI LCIIPLLSLA VYFIIKVAFP
LFITMQKKLD ALNSTTQENL TNIRVVKSFV REQYEEEKFK KANIDLKEST MKAMKIVIFT
IPAMSLAMNI TILAVVWFGG KQIIAGSMTS GVLTAFVNYV IQILISLVMV SFIILNSSRT
LASVRRINEV LDTDIDLTDE NAVYKDKTVE YGKIEFREVY FKYYKNNEKW VLENINLVIN
PGETVGIIGS TGSGKSSLVQ LIPRLYDVDF GEVLVDDVNV KDYSLKNLRN GVGIVLQKNV
LFSGTIMENL KWGDENAGDD EVYKFAESAQ AHGFITSFEK GYDTELGQGG VNVSGGQKQR
LCIARALLKK PKILILDDST SAVDTATEAK IRESFRNELK GTTKIIIAQR ISSVIDADKI
IVLDDGKIVG IGNHEELMKN CETYSEIYYS QMDKKVTA