Gene Daud_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1687 
Symbol 
ID6026805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1780357 
End bp1781427 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID641594508 
Productglucose-1-phosphate thymidyltransferase 
Protein accessionYP_001717819 
Protein GI169831837 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGGCTC TGGTTTTATC CGGCGGGAAG GGGACGCGGC TGCGGCCCCT GACCTATACC 
ACGGCGAAGC AGCTTATCCC GGTGGCGAAT AAGCCCATCC TGCACTTCGT CCTGGAGCAG
ATCGCTACTG CGGGGATCGA GGACGTGGGG GTGATCATTT CGCCCGAGAC CGGCGGCATG
GTGCAGGATG CGCTCGGCGG CGGGGCGGGG TTCGGCCTGC GGCTGACCTT TATTGTGCAG
GACGAGCCCC TGGGCCTGGC GCACGCGGTC AAGACGGCCC GCGCTTTCCT CGGCGATTCG
CCGTTCCTGA TGTTCCTGGG GGACAACCTG GTGCAGGGCG GGGTGGCCCC GCTGGCGGCC
GATTTCCGGC GGGACACTTC CACGTCGATT ATTCAGTTGA AGAAGGTTCC CGACCCCCGG
GCCTTCGGGG TGGCGGTGCT GGACGGCGGC GGCAGGGTGG CGCGGCTGGT GGAGAAGCCG
AAGGAGTTCA TTTCCGACCT GGCGCTGGTG GGCATTTACG CCTTTTCTCC CGCCGTCCAC
GCGGCCATCG AACGGATTAA GCCGTCCTGG CGGGGAGAGC TGGAGATCAC CGACGCCATT
CAGGAGCTGA TTAACATGGG CCACGCGGTG GCGCCGCGCC TGCTGGAAGG CTGGTGGCTG
GACACCGGGA AGAAGGACGA CATCCTGGAG GCCAACCGGG TGGTGCTCGA CGAGTTCACC
CGCCGCCGGG TCGAGGGCAC GGTGGACGAG GCCTCGCAGG TCGTGGGCCG GGTGGAGATC
GAGGCCGGCG CCGTGGTGGA GAGGAGCGTC ATCCGGGGGC CGGCGGTGGT GGGGGCCGGG
GCGAAGATCG TAGACAGCTT CATCGGTCCC TACACGGCCA TCGGCCGGGG AACCGCCGTG
GAGGATTGCA GCGTGGAACA TTCCGTGATC CTGGATAACT GCCGGCTGCG GGCGGTGCAC
CACATCGAGG ACAGCCTCAT CGGCTCCGGT GCGCGCCTGA CGCGGGACGA TAGCCGCCGC
CGGGTTCTAC GCTTTTTTAT TGGCGACGAG TGCCAGATTA CCCTTAGTTA G
 
Protein sequence
MKALVLSGGK GTRLRPLTYT TAKQLIPVAN KPILHFVLEQ IATAGIEDVG VIISPETGGM 
VQDALGGGAG FGLRLTFIVQ DEPLGLAHAV KTARAFLGDS PFLMFLGDNL VQGGVAPLAA
DFRRDTSTSI IQLKKVPDPR AFGVAVLDGG GRVARLVEKP KEFISDLALV GIYAFSPAVH
AAIERIKPSW RGELEITDAI QELINMGHAV APRLLEGWWL DTGKKDDILE ANRVVLDEFT
RRRVEGTVDE ASQVVGRVEI EAGAVVERSV IRGPAVVGAG AKIVDSFIGP YTAIGRGTAV
EDCSVEHSVI LDNCRLRAVH HIEDSLIGSG ARLTRDDSRR RVLRFFIGDE CQITLS