Gene Ddes_0628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDdes_0628 
Symbol 
ID7284300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 
KingdomBacteria 
Replicon accessionNC_011883 
Strand
Start bp756204 
End bp757484 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content64% 
IMG OID643581423 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002479215 
Protein GI220903903 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACCA CACTTTTTTC GCAGAATACC GCATTGCGCG GTCTTCTGGA CTCCCATCTG 
GACAGCCTGT CCGCTGAAGA GCAGCTCGAC GCCCAAAGCG TGACCGCCGC CCTTGAGGCC
GGAACTATGG TGCTGCTGGG CAACCCCGCC CATAAGGGAC TCAAACCCAT TCTTGTGGGT
CAGCCCGCCA GGGTCAAGGT GAACGCCAAT ATCGGCACGT CTCCCCTGAA CAACTGCCCC
CGCACCGAGG AGCGCAAGAT ACAGGCCGCC CTTGAAGCCG GAGCCGATAC GGTTATGGAT
CTTTCCATCG CCGGCGATCT GGACGCCCTG CGCCTCGGCA TGCTGGAGGC CTGCCCCCGG
CCGCTGGGTA CGGTTCCCCT GTACGCCGTG GGCCAGCGCA TACTGGACGC GGAGCGCGAC
ATTGCCAGCA TGAACCCGGA CGAGCTTTTT GACGAAATCG CCAAACAGGC CCGGCAAGGG
GTGGATTTTG TCACCGTTCA CTGCGGGCTT TCGCGGCGCG GAGCCGAAAT GGCCGTCAAA
AACAACCGCG CTCTGGGCAT TGTCTCGCGC GGCGGCTCCA TGCTGGCCCG CTGGATGCTC
GAAAACGACC GCGAAAACCC CTTGCTGGAG CATTTTGACC GCCTGCTCGA CATCTGCCTT
CCCTACAACG TGACCCTCTC CCTCGGCGAC GGACTGCGCC CCGGCGCGGG CGTTGATGCG
GGCGATGCCG CCCAGTGGGA AGAGGTCATC AATCTGGGGC GCCTTGCCAG ATACGCGCTT
GAACGCGGCG TGCAATGCAT GATTGAAGGC CCCGGCCACG TGCCCCTCAA CCAGGTGCGC
ACCCAGATTC AGGGCATAAA GCGCCTCACC CATAATGCCC CGCTCTATGT TCTCGGCCCC
CTGTGCTGCG ACAGCGCGCC GGGCTACGAC CATATCGCCG GAGCCATCGG CGGCGCACTA
GGCGTGGAAG CAGGCGTGGA CTTTCTCTGC TACCTCACCC CTGCCGAGCA CCTCACCCTG
CCTGACGAGG CAGACGTGCG CGCCGGGGTC ATGGCTTCGC GCGTGGCCGG GCATGTAGGC
GAAGTGGCTT TGGGCCACCC CCGCGCCGTG GCACGTGAGG CAGCCATGAA CGCCGCTCGC
AAGGCGCTGG ACTGGGAAGG CATGACCAAA GCCGCCCTGG ATCCACAAAT GCTTGAAAAA
CGGCGTGAGG AGCACAAGAC TGAAGAAGTC TGCGCCATGT GCGGCAAGTT CTGCGCGGTC
AAGATGCTTC AGGATCACTA G
 
Protein sequence
MSTTLFSQNT ALRGLLDSHL DSLSAEEQLD AQSVTAALEA GTMVLLGNPA HKGLKPILVG 
QPARVKVNAN IGTSPLNNCP RTEERKIQAA LEAGADTVMD LSIAGDLDAL RLGMLEACPR
PLGTVPLYAV GQRILDAERD IASMNPDELF DEIAKQARQG VDFVTVHCGL SRRGAEMAVK
NNRALGIVSR GGSMLARWML ENDRENPLLE HFDRLLDICL PYNVTLSLGD GLRPGAGVDA
GDAAQWEEVI NLGRLARYAL ERGVQCMIEG PGHVPLNQVR TQIQGIKRLT HNAPLYVLGP
LCCDSAPGYD HIAGAIGGAL GVEAGVDFLC YLTPAEHLTL PDEADVRAGV MASRVAGHVG
EVALGHPRAV AREAAMNAAR KALDWEGMTK AALDPQMLEK RREEHKTEEV CAMCGKFCAV
KMLQDH