Gene Cthe_2825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2825 
Symbol 
ID4809662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3340946 
End bp3342793 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content41% 
IMG OID640108245 
Producthypothetical protein 
Protein accessionYP_001039217 
Protein GI125975307 
COG category 
COG ID 
TIGRFAM ID[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000521887 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTA TAACGTTATA TGCCGGAAAA ATCAACCAAA TGCCCGGATT GATAAATGAA 
GTCAAGAAAT CTGTGGTGGA TTACAAGTCA GAATTATCCG CATTGAGAAA GAAAACTTTG
AACATCAACA GAAGTGTATG CAATTTGGAT GAAGTAATAA GTTCCATACA GGCATCTTCC
CAGACTCAGG ATAGAAAAAT TGATTCACTT GAGAAATTCT GCAGTGAAAG TGAGAAGTTT
ATATCGGAAG TAATACGTAT CGATGAAGAA GTTGCTGAGC TTATCAATAA ACGGAAAGAA
AATTTTTACA AAGAATATTA TTATTTAAAA CCGGAAAGCG AGAAAAGCGG CTGGGAAAAA
ATCAAGGACG GCTTAAAGTC GGTTGCGGAG TGGTGTAAAG AGAATTGGAA ATCCATTGCC
AAGATAGTGG CTGCCGCAGT AGTTATTACC GGGTTAGGGA TAGCGGCGGC ATTGACAGGA
GGGGTATTGG GAGTCATACT GGCAGGAGCA TTCTGGGGAG CATTGGCCGG AGGATTGATA
GGAGGAGCGG TTGGAGGAAT AGCCGCGGCG ATAAATGGAG GATCGTTTCT GGAAGGATTT
GCGGACGGGG CATTAAGCGG AGCGATTTCC GGAGCGGTAA CGGGAGCCGC ATGTGCCGGG
CTGGGTGCTT TGGGAGCAGC GGCAGGAAAA GGAATCCAAT GTATGAGCAC AGTGGGAAAA
GCGATAAATG TTACATCAAA GGTTACGGCA GCACTCTCGT TTGGTATGGA TGGATTTGAC
ATGCTGGCAA TGGGAGTATC ATTGTTTGAT CCATCCAACG CATTGGTTGA ATTTAACCGG
AAGCTGCATT CCAATGCACT TTATAACGGA TTCCAGATTG CTGTAAACGC GCTGGCTGTT
TTCAGTGCCG GGGCGGCATC GACAATGAAG TGCTTTGTTG CAGGTACAAT GATATTGACA
GCGGCAGGTT TGGTTGCGAT AGAGAATATC AAGGCAGGAG ACAAGGTAAT TGCGACGAAT
CCGGAGACTT TTGAAGTAGC GGAAAAGACG GTGCTTGAGA CATATGTGAG AGAGACAACG
GAGCTTTTGC ATTTGACAAT CAATGGAGAG GTAATCAAGA CAACCTTTGA GCATCCGTTT
TATGTAAAAG ATGTGGGTTT TGTTGAAGCG GGAAAACTGC AAGTAGGAGA TAAGTTGGTT
GATTCAAAAG GCAATGTTTT GGTGGTGGAA GAGAAAAAGC TTGAGATAAC AGATGAACCT
GTTAAGGTTT ATAACTTCAA AGTGGATGAT TTTCATACTT ATCATGTTGG GAAAAAAGGG
ATATTGGTAC ATAATGCAGA CTATAACCCC AAAATGGGAT TTGATGATTT GGACCTTGAG
AAAGCTACGA ACAAACAAAA AGGCAATTAT GGAGAGTATC TGGCAGATGA TAATCTTATT
AATAATCCAA AATTGAAAGA AGCAGGGTAT GATTTGGAGC GGATAGGAGG TAAGGTTCCG
ACCTCACCGG ATGATAAAAT TACAAAAGGG ATAGATGGGA TATATATAAA TAAGAATCCT
GACTCAAATG TTAAATATGT AATTGATGAG GCGAAATTTG GAAAAGCGGG ACTTAGTACA
AAGACAAGAG ATGGAAAACA AATGTCGGAT TCTTGGCTGA TAGGTGATAA AACAGGTAAT
GATAGAATTT TAGAAGCAGT GAATAATGAT AAACAATTAG CAGCTGGTAT ACTCGATGCA
TTACAAAACA ACCAAGTAGA AAGAGTGTTG TCAAAAGTGG ATGCAAACGG AAATGTAACG
ACATATAGAC TGGATAGTGA TGGTAATATA ATTGGAGTTT GGCCATAA
 
Protein sequence
MATITLYAGK INQMPGLINE VKKSVVDYKS ELSALRKKTL NINRSVCNLD EVISSIQASS 
QTQDRKIDSL EKFCSESEKF ISEVIRIDEE VAELINKRKE NFYKEYYYLK PESEKSGWEK
IKDGLKSVAE WCKENWKSIA KIVAAAVVIT GLGIAAALTG GVLGVILAGA FWGALAGGLI
GGAVGGIAAA INGGSFLEGF ADGALSGAIS GAVTGAACAG LGALGAAAGK GIQCMSTVGK
AINVTSKVTA ALSFGMDGFD MLAMGVSLFD PSNALVEFNR KLHSNALYNG FQIAVNALAV
FSAGAASTMK CFVAGTMILT AAGLVAIENI KAGDKVIATN PETFEVAEKT VLETYVRETT
ELLHLTINGE VIKTTFEHPF YVKDVGFVEA GKLQVGDKLV DSKGNVLVVE EKKLEITDEP
VKVYNFKVDD FHTYHVGKKG ILVHNADYNP KMGFDDLDLE KATNKQKGNY GEYLADDNLI
NNPKLKEAGY DLERIGGKVP TSPDDKITKG IDGIYINKNP DSNVKYVIDE AKFGKAGLST
KTRDGKQMSD SWLIGDKTGN DRILEAVNND KQLAAGILDA LQNNQVERVL SKVDANGNVT
TYRLDSDGNI IGVWP