Gene Cthe_0621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0621 
Symbol 
ID4808223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp761564 
End bp762592 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content42% 
IMG OID640106035 
Producttranslation initiation factor 2B subunit I 
Protein accessionYP_001037049 
Protein GI125973139 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0182] Predicted translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family 
TIGRFAM ID[TIGR00512] S-methyl-5-thioribose-1-phosphate isomerase
[TIGR00524] eIF-2B alpha/beta/delta-related uncharacterized proteins 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0997628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAT TGGAATATTG TAACGGAGTA TTAAAATTAT TGGATCAGAC TTTGCTTCCG 
GGAGAGCAAA AGATTGTAGA ACTGAAAAAC TATATTGAGG TTGCCGATGC AATAAAAAAT
ATGATTGTAA GAGGTGCTCC CGCCATAGGT GTAACTGCCG CCTACGGTGT GGCAATAGCT
TCAAAGGCAA TAAATACTGA TTCCAAAGAA GAATTTTTTG CCGAACTTGC AAAGGTGTGT
GATATAATTA AAAGCACGCG CCCTACAGCC GTAAATCTGT TCTGGGCTGT TGACAGAGTG
TACAGTAGAG CCGTCTCAAA CCGGGACAAG ACCATAGAGG AAATCAAAAA ACTCATTGAA
GAAGAGGCAT ACCTCATGGA AAAAGAAGAC ATTGAATCCA ACAGAAGCAT AGGCCGGTTC
GGCAATGAGT TGATAAAAGA AAATTGGACA ATCCTCACCC ATTGTAACGC AGGCGCTTTG
GCGACTTGCG ACTACGGCAC TGCTTTAGGA GTTATCCGGG CGGCTCATGA GTCAGGAAAA
AACATACAGG TTTTTGCAGA CGAAACAAGG CCTTATCTTC AGGGTGCAAG GCTTACCGCC
TGGGAATTGA TGCAGGACAA TATTCCCGTA ACGCTTATAT GCGACAATAT GGCAGGGCAT
TTCATGAAAG AAGGATTAAT AGATTGTGTA ATAGTCGGAG CTGACAGAAT AGCTTTAAAC
GGAGATACTG CAAACAAAAT CGGAACTTAT TCCCTTGCTG TTTTGGCAAA GGAAAACAAT
ATTCCGTTCT ATGTTGCCGC TCCCACAACA ACCATTGACT TTTCCATAGA AACGGGAGAG
CAAATTCCGA TAGAAGAGCG AAGCCCTGCT GAAATAACTC ATATCAAAGG CATCAGAATT
GCTCCCGAAG GTGTAAAGGT AAGGAATCCG GCCTTTGACG TAACACCAAA CAAATACATT
TCAGCAATTA TAACCGAAAA AGGTATTATC TACCCGCCAT ATGATGAAAA TATAAAAAAA
TACAGGTAA
 
Protein sequence
MKPLEYCNGV LKLLDQTLLP GEQKIVELKN YIEVADAIKN MIVRGAPAIG VTAAYGVAIA 
SKAINTDSKE EFFAELAKVC DIIKSTRPTA VNLFWAVDRV YSRAVSNRDK TIEEIKKLIE
EEAYLMEKED IESNRSIGRF GNELIKENWT ILTHCNAGAL ATCDYGTALG VIRAAHESGK
NIQVFADETR PYLQGARLTA WELMQDNIPV TLICDNMAGH FMKEGLIDCV IVGADRIALN
GDTANKIGTY SLAVLAKENN IPFYVAAPTT TIDFSIETGE QIPIEERSPA EITHIKGIRI
APEGVKVRNP AFDVTPNKYI SAIITEKGII YPPYDENIKK YR