Gene Cthe_1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1890 
Symbol 
ID4809221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2244759 
End bp2246891 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content35% 
IMG OID640107309 
Productcellulosome enzyme, dockerin type I 
Protein accessionYP_001038304 
Protein GI125974394 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA AAGGACTACT ATTACTACTA ACAGTTATTG CAGCAACCAT TGTTGTTAGC 
ATGATGTCTG CCGGTGCTAC TACATTATAC GGTGACTTAA ATGCAGATGG TTCAATCAAC
TCAACCGATT TAATGATAAT GAAGAGAGTA CTGCTCAAGC AAAGAACTCT TGATGACATT
ACTCCCGCTG ATTTGAATGG TGACGGTAAA GTAACCTCAA CAGATTATTC GTTGATGAAA
AGATACTTAC TCAAGGAAAT AGACAAATTT CCGGTTGAGG ATATAGAACC GACTCCTACA
CTGGAGGTTA GCCCAACTCC TACGGAAACC AGTGAAGAGG TATTTGCTTT TAAAATTAAA
CTATTTTCGG ATGGCGATAC ATACAGGTTT CCTATTCAAG AGATATCAGA GAATAATAAT
ATTGTTGTTG ACTGGGGTGA TGGTACAACA AGTACTATAA CTGATTATTC AACATTGAGG
CATAAGTATG AAAAAGCAGG TGTATATACA ATAAAAGTAC TTTGGTTTGA TCACATACCA
ATTCGGTTTA CAGGAGATAA GTATGTGATT GAAATACTTA CACCCCTTCC AGATATCGGA
TTAACTGACT TTAGCTCTTT TTTTAAAAAC TGTAGTAACC TAGAAAGAAT TCCAGACAGA
TTATTTTCAA ACAATATTAA TGCAACAGAC TTTAATTTCT GTTTTAGCGG CTGTACTAGC
TTGACAGAAA TACCTGAGAG TTTGTTTGCA GGCAATGTTA ATGCAACTAC CTTTGTTCGG
TGTTTTTACC GTTGTAGCAA CTTAATAAAA GTTCCGGAAG GGTTGTTTGA AAATAATGTT
AATGCGACTA ATTTTTTGGG CTGTTTCGAT GAATGTAGTA GCCTGAAGGA AATTCCAGAA
GGATTATTTT CAAATAATGT TAATGCAGCA AACTTTAGTT GGTGCTTTAG TGAATGTGTT
AGTTTAGCAA AAATTCCTGA AGGATTGTTT AGAAATAATA CTAATGCAAC AGATTTTAGT
TACTGTTTTT ATGGTTGTAC TAGCATAACA AAAATTCCCG GAGGGCTGTT TGAAAATAAT
ATTAATGCGG AAGACTTTGG TAATTGCTTT AGTGGATGCA GTAGCATAAC GGAAATTCCC
GGAGGGCTGT TTGAAAATAA TATTAATGCG GCAAACTTTG GTAGTTGCTT TAGTGGATGT
AGTAGCATAA CGGAAATTCC AGAAGGGCTA TTTGAAAATA ATATTAATGC GGAAGACTTT
AGAGGTTGCT TTAGTGGATG CAGTAGCATA ATGGAAATTC CAGAAGGGCT ATTTAAAAAT
AATATTAATG CGGAAGACTT TAGAGGTTGC TTTAGTGGAT GCAGTAGCAT AACGGAAATT
CCCGGAGGGC TGTTTGAAAA TAATATTAAT GCGGAAGACT TTGGAGGTTG CTTTAGTGGA
TGCAGTAGCA TAACGGAAAT TCCCGGAGGG CTGTTTGAAA ATAATATTAA TGCATCAGAC
TTTAGTAGTT GTTTTAGTGG ATGCAGTAGC ATAACGGAAA TTCCTGGGGG TTTGTTTAGA
AATAATATTA ATACAACAAG ATTTATGGAG TGCTTTAAAG GATGTAGCAG CGTAACAGAA
ATCCCTGAAG AGCTATTTGC CAATAATGTT GATACAGCTA TCTTTATAGG TTGTTTTAGT
GAATGCATCA GTTTGAGAAA AATTCCAGAA GGACTGTTTA AAAATAATAT TAACGTAATA
AGCTTTATGG AGTGCTTTAA AGGATGTAGT AACCTAACAG AAATCCCTGA AGGGCTATTT
GTAAATAATA CTAATGCAAC AGACTTTCAA GGTTGTTTTT ATGGGTGCAG TAGTTTGACA
GAAATTCCTG CGAGATTATT TACGAATAAT GTTAATGTAA CTAATTTTAG AGAGTGTTTT
AGGGATTGTA CGAGCTTAAT AGAAATTCCA GAGAGTCTTT TTGATAGCAA TGTTAATGTC
ACCAATTTTT ATAGATGTTT TTATGGGTGC AAAAACTTAA CAGGTGTAGC ACCTGCTTTA
TGGCTGCGTA CAAATGTTAA AGAATTTTCG GGTTGCTTTG GAAGCTGTAC TAAACTGTCC
AACTATAATG ATATTCCAAA AGGTTGGAAA TAA
 
Protein sequence
MRKKGLLLLL TVIAATIVVS MMSAGATTLY GDLNADGSIN STDLMIMKRV LLKQRTLDDI 
TPADLNGDGK VTSTDYSLMK RYLLKEIDKF PVEDIEPTPT LEVSPTPTET SEEVFAFKIK
LFSDGDTYRF PIQEISENNN IVVDWGDGTT STITDYSTLR HKYEKAGVYT IKVLWFDHIP
IRFTGDKYVI EILTPLPDIG LTDFSSFFKN CSNLERIPDR LFSNNINATD FNFCFSGCTS
LTEIPESLFA GNVNATTFVR CFYRCSNLIK VPEGLFENNV NATNFLGCFD ECSSLKEIPE
GLFSNNVNAA NFSWCFSECV SLAKIPEGLF RNNTNATDFS YCFYGCTSIT KIPGGLFENN
INAEDFGNCF SGCSSITEIP GGLFENNINA ANFGSCFSGC SSITEIPEGL FENNINAEDF
RGCFSGCSSI MEIPEGLFKN NINAEDFRGC FSGCSSITEI PGGLFENNIN AEDFGGCFSG
CSSITEIPGG LFENNINASD FSSCFSGCSS ITEIPGGLFR NNINTTRFME CFKGCSSVTE
IPEELFANNV DTAIFIGCFS ECISLRKIPE GLFKNNINVI SFMECFKGCS NLTEIPEGLF
VNNTNATDFQ GCFYGCSSLT EIPARLFTNN VNVTNFRECF RDCTSLIEIP ESLFDSNVNV
TNFYRCFYGC KNLTGVAPAL WLRTNVKEFS GCFGSCTKLS NYNDIPKGWK