Gene Cthe_1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1871 
Symbol 
ID4809202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2220405 
End bp2222282 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content37% 
IMG OID640107290 
ProductTn7-like transposition protein D 
Protein accessionYP_001038285 
Protein GI125974375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCATT TTTTCCCAAC CCCATATCCT GATGAAATCC TTTATAGTGT ATTGGCACGC 
TATAGTGTCC GCTGTGGGAT TACAAGTTAT CAAACTATCA TGGAGAGCAT ATTTGGAAAG
TGTAGTTCCA GGGCTGTAAT GGAAATGCCT TTTAATTTGA ATTCGCTGGT ATCTAACCTA
CCTGTGAATT GTCCTTATAC TGCTGATGAT TTGATTTATA ATCATACTCT GTACCCATTC
TTTACTGCAT TTCTTCCAAA AGAACGGGCA GAAGAAGTAA AACAATTAAT GCTGTCTGAA
GGTGGAAGCA AAATTTATGG TAAAGCAGGT ATTATCGGCA GTAGGATTCC ACTAAACCAA
TATTTAAGAT TTTGTCCCAA ATGCTTCGAA GAAGAGCAGA AATTGTATGG TGAAGGATAT
TGGCACAGGT TGCATCAGAT ACCCTTTGTA ATGGTGTGTC CTATACATAA AGCGATTCTT
CATAATAGTA CTGTTTTAGT GCGGGGTCAT AATCCACAAG CCTATGTACC TGCTGACGCT
GATAACTGCA TTAATAATGA ATTACTTTAT TTCAAACCTG AAACGATAGA GAAGTTTGTA
CTCATTGCCC AAGATGCAAA AGTATTGCTT GAAAACCAGT ATCCCAATAA GCAGTTTAAA
TGGTTTGCTA AACAGTATTT AGAAAGGTTT AAGGAATTAG GGTATGCCAA TATTAATGGT
AAAGTGTATT GGGAAGAAGT AATAAAGGAC TTCATAGATT TCTACGGATT GGAGTTCCTA
AATGCTGTTT GTTCTAATAT AGAAGACAAG GAACAGGGAA GATGGCTGAA AGAAATTACA
CATAGTGATG CAAAGTCGGT GTATCCAATT CGGCACCTGA TGCTGGCAAG ATTTTTAGGA
ATAGGAGTAG AAGAACTTTT TATAAAAGAG TTAAATTACA GACCTTTCGG AGAAGGGCCT
TGGGTATGTC TTAATCCTGC TGCTGACCAC TTTCTTGAAC CTGTAATAAA AGACGTAGAA
ATAAAGTATA GGCGTAGAAA TAAGAACACT AATGGTTTTT TCAGATGTCC CTGCGGATAT
GAATATATGG AAACAGTTAC AAGAGAACAA GGAGAAAGGG GTAAGGGGCG GCGCAGATTT
GTCAGGGTTG TAGAGTATGG ACATGTATGG AAGGCTAAGG CTCATGAATT GCATGAAAGT
GGAGTAAGTA TTCAGGAAAT TGCAGTAAGA CTTAATGCAG ATATTAGTAC TGTAAGAAAG
TATATTTCTG AAAAAGGACA GGAAAAAAGC AAAAAGTTAG TAGAACGTAA TAGTGTTGCT
TCTGCGGAGT TTGAAGTAAA AAGACAGTAT CACAGGGAAA AGTGGCTTCA GATAGTCAGG
GAAAATCCAG ACAAGGGCAG ACTTGAATTA CGGAGGCTGG GTAAATATAC CTGTACCTGG
TTATGCAGGA ATGACAGGGA GTGGTGGGAG AAAAACACCC CTGCAAAGAA ATACGTTCAG
GCTTACAGTA ATGTTGATTG GGAAACTAGG GACAAGGAAA TCTTACAGCA TGTTAAGCAA
ACTGTTCAGG AGATTTTAGA AAGTGATGAA AAACCTCAAC GAATTAGCCT GCGGTTGATT
AAAACTAAGT CAGGACTTAA AAGTTTTGAC CTTCAATTGG ATAAATTACC GTTGACTAAA
TCATTCATCA ATTCTGTTAT AGAGAACCCT ATGGATTTGC ACAAAAGGCG TATACAATGG
GCTATTGAAA AACTTAATAA AGAAGGAAAA GCATTAACTA TTTCTAATAT TACTGTCATG
ACAGGTGTTG GTAATAAATA TAGGAAACTA GTTATAGAGG AGATTAAAAA GGCGTTAGAA
GAGTTAGGTG AGAGGTAG
 
Protein sequence
MMHFFPTPYP DEILYSVLAR YSVRCGITSY QTIMESIFGK CSSRAVMEMP FNLNSLVSNL 
PVNCPYTADD LIYNHTLYPF FTAFLPKERA EEVKQLMLSE GGSKIYGKAG IIGSRIPLNQ
YLRFCPKCFE EEQKLYGEGY WHRLHQIPFV MVCPIHKAIL HNSTVLVRGH NPQAYVPADA
DNCINNELLY FKPETIEKFV LIAQDAKVLL ENQYPNKQFK WFAKQYLERF KELGYANING
KVYWEEVIKD FIDFYGLEFL NAVCSNIEDK EQGRWLKEIT HSDAKSVYPI RHLMLARFLG
IGVEELFIKE LNYRPFGEGP WVCLNPAADH FLEPVIKDVE IKYRRRNKNT NGFFRCPCGY
EYMETVTREQ GERGKGRRRF VRVVEYGHVW KAKAHELHES GVSIQEIAVR LNADISTVRK
YISEKGQEKS KKLVERNSVA SAEFEVKRQY HREKWLQIVR ENPDKGRLEL RRLGKYTCTW
LCRNDREWWE KNTPAKKYVQ AYSNVDWETR DKEILQHVKQ TVQEILESDE KPQRISLRLI
KTKSGLKSFD LQLDKLPLTK SFINSVIENP MDLHKRRIQW AIEKLNKEGK ALTISNITVM
TGVGNKYRKL VIEEIKKALE ELGER