Gene Daro_3930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3930 
Symbol 
ID3567661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4225534 
End bp4226541 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content60% 
IMG OID637682404 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_287128 
Protein GI71909541 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATTC TCGTGACCGG CGCTGCCGGA TTCATCGGCA TGACTACCTC GCTGCGCCTG 
CTGGCCCGCG GCGATGAGGT GGTCGGTCTC GACAACATGA ACGACTACTA CGAAGTGTCG
CTCAAGGAAA ACCGTCTGAA GCGCCTGACT GCCCTCCCCG GCTTCCGCTT CGTCAAACTG
GATGTGGGCG ACCGGGCCGG CATGGAAAAG CTGTTTGCCG ACGAGAAATT CGACAAGGTC
ATCCACCTCG CCGCCCAGGC CGGCGTCCGC TATTCGATCC AGAACCCGCA CGCCTACGTC
GATAGCAATC TGGTCGGCTT CATCAATATT CTTGAAGGTT GCCGCCACCA CAAGGTTCAG
CATCTGGTTT ACGCCTCCAG TTCCAGCGTC TATGGCGGCA ACACCAAGAT GCCCTTCTCC
GAGCACGATA GCGTCGACCA CCCGGTCAGC CTTTACGCCG CAACCAAGAA GGCCAACGAG
CTGATGGCCC ACACCTACAG CCACCTCTAT GGCTTGCCGA CGACCGGCCT GCGCTTCTTC
ACGGTCTATG GCCCGTGGGG ACGTCCGGAC ATGGCGCTCT TCCTGTTCAC CAAGGCCATC
CTCGAAGGCC GGCCGATCGA TGTCTTCAAC CACGGCAACA TGAAACGCGA CTTCACCTAC
GTAGACGACA TCGTCGAAGG TGTCATCCGC GTGATGGATC GCAATGCCGC GGCCAATGCC
GAATACGACT CGCTCTCCGC CGATCCGGCG ACCAGCAATG CCCCCTACCG GGTGTTCAAC
ATCGGCAACA ACAATCCGGT GCAGTTGCTC GACTTCATCG GCGCCATCGA AACCGCCCTT
GGCCAGAAGG CCGAAAAGCG CCTGTTGCCG CTGCAGGACG GCGATGTCCC TGCCACCTAC
GCCAACACCG ATCTGCTCAA TGACTGGGTC GGCTTCGTCC CTGGCACCTC CGTTCAGGAA
GGTGTCAGCA AATTCATCGC GTGGTATCGC GACTACTACA AGGTTTAA
 
Protein sequence
MKILVTGAAG FIGMTTSLRL LARGDEVVGL DNMNDYYEVS LKENRLKRLT ALPGFRFVKL 
DVGDRAGMEK LFADEKFDKV IHLAAQAGVR YSIQNPHAYV DSNLVGFINI LEGCRHHKVQ
HLVYASSSSV YGGNTKMPFS EHDSVDHPVS LYAATKKANE LMAHTYSHLY GLPTTGLRFF
TVYGPWGRPD MALFLFTKAI LEGRPIDVFN HGNMKRDFTY VDDIVEGVIR VMDRNAAANA
EYDSLSADPA TSNAPYRVFN IGNNNPVQLL DFIGAIETAL GQKAEKRLLP LQDGDVPATY
ANTDLLNDWV GFVPGTSVQE GVSKFIAWYR DYYKV