Gene Dfer_2462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2462 
Symbol 
ID8226034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp3031780 
End bp3032898 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content54% 
IMG OID644930294 
Productthiazole biosynthesis protein ThiH 
Protein accessionYP_003086845 
Protein GI255036224 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTA AGGAAGAATT CGCGCGGTAT TCGTGGGACA ACGTGAAGCA GGATATTTAT 
TCCAAAACGG CGGCCGACGT GGAAAGAGCA CTCATAGCCC CGAAAAGGAC TTTGGAAGAT
TTCAAAGCAT TGATTTCGCC CGCCGCGGCC GCTTACCTGG AACCCATGGC CAGGCTGAGC
CGGCAGCTGA CGCGCAAGCG CTTCGGCAAC ACCATTCAAA TGTACGTGCC GCTGTATTTG
TCCAACGAGT GTACCAATAT ATGTACCTAC TGCGGGTTCA GCCTGGATAA TAAAGTGCGG
CGTCGCACGC TGACAGAGCG TGAAATACTG CAAGAAGTGG AGGTGATTAA AGGCATGGGC
TACGAACACG TGCTGCTGGT GACCGGCGAG GCGAACCAGA CGGTGCACGT GGACTATTTC
AAAAAGGTGT TGGCCCTGAT CCGTCCGCAT TTCGCGCAGG TATCCATGGA AGTACAGCCG
CTCGACCGCG ATGAATACGA GGAACTGATC CCGCTCGGCC TGCATTCGGT GCTGGTGTAC
CAGGAAACTT ATCACCAGGA AGATTACCGC AAACACCACC CGAAAGGCAA AAAAGCGAAT
TTCAACTACC GCCTCGAAAC GCCCGACCGT CTCGGGCAGG CAGGGATTCA CAAAATGGGC
CTCGGCGTGC TGATCGGCCT GGAAGACTGG CGTACCGACA GTTTTTTCAC TGCCTTGCAT
TTGCATTATC TTGAAAAAAC CTACTGGCAA ACGCGTTACA GCCTATCCTT TCCGCGCCTG
CGCCCATTCT CCGGCGGCCT CGAACCGAAA GTGGAAATGA GCGACCGCGA ACTGGTGCAG
CTGATTTGCG CCTACCGCAT ATTTGACGAG GAAGTAGAAC TATCCCTCTC CACACGCGAA
TCGGACCGAT TCAGGAACCA TTGCATTCAA TTGGGTGTAA CATCCATCAG CGCCGGCTCC
AAAACCAACC CCGGAGGCTA CGCCGTAGAA CCGGAATCAC TCGAACAATT CGAAATTTCC
GACGAACGAA GCCCCGCGGA AATAGCACAA ATGATCCGCC AAGCCGGCTA CGAGCCAGTT
TGGAAGGATT GGGACGCTGG GCTAATTTTG AATGAGTGA
 
Protein sequence
MSFKEEFARY SWDNVKQDIY SKTAADVERA LIAPKRTLED FKALISPAAA AYLEPMARLS 
RQLTRKRFGN TIQMYVPLYL SNECTNICTY CGFSLDNKVR RRTLTEREIL QEVEVIKGMG
YEHVLLVTGE ANQTVHVDYF KKVLALIRPH FAQVSMEVQP LDRDEYEELI PLGLHSVLVY
QETYHQEDYR KHHPKGKKAN FNYRLETPDR LGQAGIHKMG LGVLIGLEDW RTDSFFTALH
LHYLEKTYWQ TRYSLSFPRL RPFSGGLEPK VEMSDRELVQ LICAYRIFDE EVELSLSTRE
SDRFRNHCIQ LGVTSISAGS KTNPGGYAVE PESLEQFEIS DERSPAEIAQ MIRQAGYEPV
WKDWDAGLIL NE