Gene Cthe_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1839 
Symbol 
ID4809385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2183456 
End bp2184517 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content44% 
IMG OID640107253 
Productbiotin synthase 
Protein accessionYP_001038253 
Protein GI125974343 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0814889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACA TGACAAATAT GATAAATTTA ATTGACAAAC TTTCAACAAC ACATACTTTG 
TCGTATGATG AAATGTATCA GCTTATTGAG CATAGAAACG AGGAACTTGC CAATTATCTG
TTTGAAAAGG CAAGGCAGGT GCGTATCCTC TACTACGGCC ACGATGTCTA TATGCGCGGT
CTCATTGAAT TCACCAATTA CTGTCGAAAT GACTGCTACT ATTGCGGAAT AAGGAAAAGC
AACTGCAATG CCGAAAGATA CCGTCTTACA AAAGAGCAAA TACTTGAATG CTGTGACGTG
GGATATGAGC TGGGTTTTCG CACCTTCGTG CTTCAAGGGG GCGAAGACGG TTATTATACC
GACAAAATTT TGGCGGACAT AGTAAGCAGC ATCAAGGCAA AATATCCCGA TTGTGCGATT
ACTCTCTCTT TGGGTGAAAA AAGTTATGAA AGCTATAAAT TGCTTTATGA GGCTGGAGCG
GACAGATACC TTCTTCGCCA TGAAACAGCA AATGCCCAGC ACTACTCAAA GCTTCATCCG
CCTGTTATGT CCCTTAAAAA CAGAAAACAA TGTCTTTACA ATCTCAAAGA AATAGGATAC
CAGGTAGGTT GCGGTTTTAT GGTCGGTTCA CCGTTTCAGA CCACGGAATG TCTCGTTGAT
GACTTAATGT TTATAAAAGA ATTGCAGCCC CACATGGTGG GAATAGGTCC GTTTATCCCG
CACAAGGATA CGCCTTTTGC CGGCAAACCC GCCGGTACCC TGGAGCTGAC ATTGTTCCTT
CTCGGCATCA TACGGCTAAT GCTTCCCTAC GTTCTGCTTC CGGCCACCAC AGCCCTTGGC
ACAATCCATC CCAAAGGCAG GGAACTGGGT ATTCTTGCAG GCGCAAACGT GGTAATGCCA
AACCTTTCGC CGAAAGAAGT AAGAAGCAAG TATCTTTTAT ATGACAATAA AATCTGTACC
GGGGATGAAG CCGCAGAATG CAGAATGTGC CTAACCCACC GTATTGAAAG CATCGGATAC
AAACTGGTTG TGTCAAGAGG CGACTGCAAA AAGCCAAATT AA
 
Protein sequence
MTNMTNMINL IDKLSTTHTL SYDEMYQLIE HRNEELANYL FEKARQVRIL YYGHDVYMRG 
LIEFTNYCRN DCYYCGIRKS NCNAERYRLT KEQILECCDV GYELGFRTFV LQGGEDGYYT
DKILADIVSS IKAKYPDCAI TLSLGEKSYE SYKLLYEAGA DRYLLRHETA NAQHYSKLHP
PVMSLKNRKQ CLYNLKEIGY QVGCGFMVGS PFQTTECLVD DLMFIKELQP HMVGIGPFIP
HKDTPFAGKP AGTLELTLFL LGIIRLMLPY VLLPATTALG TIHPKGRELG ILAGANVVMP
NLSPKEVRSK YLLYDNKICT GDEAAECRMC LTHRIESIGY KLVVSRGDCK KPN