Gene Daro_1257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1257 
Symbol 
ID3569364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1364952 
End bp1366049 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content62% 
IMG OID637679723 
ProductNAD-dependent epimerase/dehydratase:polysaccharide biosynthesis protein CapD 
Protein accessionYP_284482 
Protein GI71906895 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGT TCGGTGACTT CTACCGCGGG CGCCGGGTCC TGATCACGGG ACATACCGGC 
TTCAAGGGAT CCTGGCTCGC ACTATGGCTG CTGCAACTGG GCGCCGAGGT GGCAGGCTTT
TCGATTGACG TGCCTTCCAA TCCTTCCAAT TTTGAACTGC TGGGCTTGAA GGAGCGCTTG
CGCCACTACA GCGGCGACGT CTGCCAGCTT GGCCAGCTCG AGCAGGCGAT CGATGAATTC
CAGCCGGAGA TCATTTTTCA CCTAGCTGCC CAGGCCTTGG TGCGGCGCTC TTACCAGGAT
CCGCGCGGCA CTATCGAGAC CAACGTGATG GGCATGGTCA ACGTCCTGGA ACTGGTCCGT
AGCCGACCTT TCATCAGGAC AGCCGTCCTG ATCACGAGCG ACAAGGCCTA TCGAAACGAC
GAATGGTGCT GGGGCTATCG CGAAACGGAC GCCCTCGGCG GACATGATCC CTACAGCGGC
TCCAAGAGCT GCGCCGACCT AGTGGCGCAA ACTTACATCC ACTCCTATCT GCGCAGTTCC
GGCAAGCGGG TGGCAATTAC TCGCGCCGGC AACGTGATCG GCGGCGGCGA CTGGGCCAGC
GATCGCATCG TGCCCGACTG CATCCGCGCC TGGACCGAGG GCGGTTCAGT AGAAGTGCGC
AGTCCATCCG CCACCCGCCC CTGGCAACAT GTGCTCGAGC CTCTGGGAGC CTACCTCTGG
CTCGGAGCCT CGCTGCAGCG TGACGAGCGC ATCAATGGCG AGGCCTTCAA TTTCGGTCCG
GCGGCTCACG TCAACCAGAG CGTAGGGGAG TTGCTTGACG CCATGGCCCA GCGCTGGCCA
GGCGCCCAAT GGAACTCGCC TGCCAATGCG GGCCGGCCCG CTCAAACCGA AGCCACGCTG
CTGAAGCTCT CCTGCGACAA GGCGCAGGCC TACCTGGACT GGCGCGCGGT ACTCGATTTC
AGCGACACGG TGGAGCTCAC CGTCGACTGG TATCGAAACT GGTACGAAGG GAAGCAGGAC
ATTTTCGCCT ATGCGCTGCG CCAGGTTGAG GCCTACTGCG ACCTGGCCAA AGTCAGGGGA
GCGAAATGGC TGACGTGA
 
Protein sequence
MKQFGDFYRG RRVLITGHTG FKGSWLALWL LQLGAEVAGF SIDVPSNPSN FELLGLKERL 
RHYSGDVCQL GQLEQAIDEF QPEIIFHLAA QALVRRSYQD PRGTIETNVM GMVNVLELVR
SRPFIRTAVL ITSDKAYRND EWCWGYRETD ALGGHDPYSG SKSCADLVAQ TYIHSYLRSS
GKRVAITRAG NVIGGGDWAS DRIVPDCIRA WTEGGSVEVR SPSATRPWQH VLEPLGAYLW
LGASLQRDER INGEAFNFGP AAHVNQSVGE LLDAMAQRWP GAQWNSPANA GRPAQTEATL
LKLSCDKAQA YLDWRAVLDF SDTVELTVDW YRNWYEGKQD IFAYALRQVE AYCDLAKVRG
AKWLT