Gene Dred_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDred_3031 
Symbol 
ID4956660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum reducens MI-1 
KingdomBacteria 
Replicon accessionNC_009253 
Strand
Start bp3290312 
End bp3291451 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content44% 
IMG OID640182220 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001114359 
Protein GI134300863 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGATA CAGCAATAAA AAAACTAAAA CTAATGACCA TCCTCGGCAC CAGACCGGAG 
ATAATACGGC TGTCCGAAGT TATCAAAAAA TGCGATATTT ATTTTGACCA TATTCTGGTG
CATACCGGCC AGAATTGGGA TTACACGCTT AACCAGATTT TCTTTGAGGA TTTGGGTTTA
AGGGAGCCTG ATTACTATCT GGAGGCAGTC GGTGGAGATT TGGGTGAAAC CATTGGCAAT
ATTATAGCGA AAAGTTATAA GGTGCTCTCA GAAGTAAAAC CGGATGCACT CTTGATCCTG
GGCGATACAA ACTCAGCTCT GTCAGCCATA TCAGCCAAAA GACTGAAAGT GCCTATCTTT
CATATGGAAG CCGGCAACCG CTGTTTTGAT GAGAATCTAC CCGAAGAAAC CAATCGCAGA
ATTGTTGACC ATATCGCGGA CGTGAATCTG TGCTACAGCG AACACGCCAG GCGCTATTTG
AACTATGAAG GCGTAGCCAA AGAGCGCACG TATGTAACAG GGTCCCCTAT GGCAGAAGTG
CTCACCGCCA ATATTCAAAA AATTAAGCAC AGCAAGGTAG TTGAGCATTT GGGCTTGGAG
AAGGGGAAAT ATATTTTGCT TTCTGCCCAT CGTGAGGAAA ATATTGACAT TGAGGAGAAC
TTCTTGGCCT TGATGAACGC GGTCAATGCT ATGGCTGAGC ATTATGATAT GCCTATTATA
TACAGTACAC ATCCCCGCAG CGCCAAGTTT ATTGAACAGA GGGGCTTTAA ATTTCACCCC
TATGTGCGCA GTTTAAAACC TTTTGGTTTC TCGGATTATA ACAATTTGCA GTTAAACGCA
TTCTGTGTAG TATCAGATAG CGGCACTATA CCGGAAGAAG CATCCTATTT CAAATTCCCG
GCTGTATCTG TGCGAACCAG CACCGAGCGG CCGGAATCTA TGGATAAAGG TAATTTCATC
ATTGGAAGTA TCAGTACAGA GCAGGTACTG CAGGCAGTTG ACCTAGCTGT TGCAATGTAT
AAAAATGGTG ACCTAGGCGT CACAACACCT GACTATGCGG ATGAAAATGT AAGCGTGAAG
GTAGTCAAGA TTATTCAGAG TTATACGGGG ATTGTTAATA GGATGGTTTG GAGGAAGTGA
 
Protein sequence
MADTAIKKLK LMTILGTRPE IIRLSEVIKK CDIYFDHILV HTGQNWDYTL NQIFFEDLGL 
REPDYYLEAV GGDLGETIGN IIAKSYKVLS EVKPDALLIL GDTNSALSAI SAKRLKVPIF
HMEAGNRCFD ENLPEETNRR IVDHIADVNL CYSEHARRYL NYEGVAKERT YVTGSPMAEV
LTANIQKIKH SKVVEHLGLE KGKYILLSAH REENIDIEEN FLALMNAVNA MAEHYDMPII
YSTHPRSAKF IEQRGFKFHP YVRSLKPFGF SDYNNLQLNA FCVVSDSGTI PEEASYFKFP
AVSVRTSTER PESMDKGNFI IGSISTEQVL QAVDLAVAMY KNGDLGVTTP DYADENVSVK
VVKIIQSYTG IVNRMVWRK