Gene Daud_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0471 
Symbol 
ID6025788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp507570 
End bp509168 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content70% 
IMG OID641593311 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001716649 
Protein GI169830667 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTGG TCACCGCGGC CGAAATGCGG GAGATAGACC GGCGGGCCAC CGAGGAGTAC 
GGCGTCTTGG GACTCGTCCT TATGGAGAAC GCGGGCCTTA AGGTGTTTGA GTGTGTGCGC
CGGGTCCTGG GCGGGGTGGA CGGGAAACAG GTGATAGTCC TGGCCGGAAA AGGGAACAAC
GGCGGAGACG GGTTGGTGGC CGCCCGCCAC CTGCTGCAGC ACGGGGCCCG GGTGAAAGTA
ATGCTTAGCG GCGAACCCGC GGATGTGACG GGCGATGCGG GCATTAACCT GGAGATCTGG
AAGCGGTTGG GGCAGCGGCT GTACCTGATG CAGGACCGTA ACGCCATTCA GCTGCTGCAA
CTTGCCCTGA TGCAGACGGA CCTGGTGGTG GACGCGCTCT TCGGCACGGG TTTCCGTGGC
GAGATCAGGG ACCGGGCCCG CAAAGTTATC GAGGCCGTCA ACGAGTCGGG CAAACCGGTG
GTGGCCGTGG ACATCCCGTC CGGGGTGGAG GCCGACACCG GCGCGGTGCG CGGGCCGGCC
ATCCAGGCGA CCCACACGGT CACGTTCGGT CTGCCCAAGC TCGGGCTGGT CCTGGAACCG
GGGGCGGGGA GGACCGGCGA GCTGCACGTG GCGGACATCT CCCTGCCGCG GCCGCTGGTG
GAGGCGGAGG GTGGCCGTTA CCTGCTCACT CCGGCGCTCG TTCGGGACTG GCTGCCCCGA
CGCGAGGCGG AGGCGCACAA GGGACGATTC GGTCACGTTC TGTTGGTGGC GGGGTCGAGG
GGGATGGTCG GCGCGGCCGT TCTGGCCGCC CGCGCGGCGG CTCTGACGGG TGCCGGGTTG
GTTACCCTGG CGGTGCCCCG CAGCATCCAG AACGTGGCCG CCGGTTTCCA GCCGGAGATT
ATGACCCTGG GATTGCCCGA GACCGGCGCG GGAACCCTGA GCCGGGCAGC CCGGGAGCAG
ATCGAGGAGT TCCTGCCGCG TGCCTCCGTG CTCGCCCTGG GTCCGGGACT CACCACCCAC
CCGGAGACGG CGGAACTGGT CCGGGAGCTT TTGCCCGGGG TGCGGGTGCC GTGCGTCCTG
GACGCCGACG GCCTGAACGC CTTCGGGGGT GGGGAACGGG AGACCGACCG GAATTCGGCC
GGGACGCCCG TCGGAGAGAG CCTCCCGCCC GGCGGCTTCC GGGAGAAACC CGACCTGGTG
CTCACGCCGC ACCCGGGGGA AATGGCGCGG CTTCTGGGTT TGAAGAGCGC GGCCGAAGTG
CAGGCCGACC GGCTCGGTGT CGCCGAGCGC ACGGCCGCCG CCTGGCGGTG CACGGTGGTG
CTGAAAGGGG CCCGCACGCT GGTGGCCGAA CCGGGCAAGA CTTACATCAA CCCGACGGGG
AACCCCGGGA TGGCCACGGG CGGAACCGGA GACGTTCTGA CCGGGGTGAT TGCCGGGCTT
CTGGCGCAGG GTCTGGAACC CGGTCCAGCG GCGGCCGCCG CCGCCTTCCT GCACGGGCGG
GCCGGCGACC TGGCGGCCGC CGAACGTGGG CAGGCGTCCC TACTGGCCGG AAACCTGCTG
GAGTATCTGC CCGCAGCTTT CCACGAACTC GGCGCCTGA
 
Protein sequence
MRVVTAAEMR EIDRRATEEY GVLGLVLMEN AGLKVFECVR RVLGGVDGKQ VIVLAGKGNN 
GGDGLVAARH LLQHGARVKV MLSGEPADVT GDAGINLEIW KRLGQRLYLM QDRNAIQLLQ
LALMQTDLVV DALFGTGFRG EIRDRARKVI EAVNESGKPV VAVDIPSGVE ADTGAVRGPA
IQATHTVTFG LPKLGLVLEP GAGRTGELHV ADISLPRPLV EAEGGRYLLT PALVRDWLPR
REAEAHKGRF GHVLLVAGSR GMVGAAVLAA RAAALTGAGL VTLAVPRSIQ NVAAGFQPEI
MTLGLPETGA GTLSRAAREQ IEEFLPRASV LALGPGLTTH PETAELVREL LPGVRVPCVL
DADGLNAFGG GERETDRNSA GTPVGESLPP GGFREKPDLV LTPHPGEMAR LLGLKSAAEV
QADRLGVAER TAAAWRCTVV LKGARTLVAE PGKTYINPTG NPGMATGGTG DVLTGVIAGL
LAQGLEPGPA AAAAAFLHGR AGDLAAAERG QASLLAGNLL EYLPAAFHEL GA