Gene Daro_2379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2379 
Symbol 
ID3568593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2564179 
End bp2565699 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content65% 
IMG OID637680846 
Producthypothetical protein 
Protein accessionYP_285585 
Protein GI71907998 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAT CCAGCCTCTA CCTGAGCACC ACGCTGCGTC GCGTTGAAAC CCGGCACGCC 
ACTGAGCCAC TGATGCAACG CGCCGGTGCC GCCGCAGCCG AATGGGTGGC AACGCTGGCC
GGCGACCAGA ATCATCCTGT CCTTGTCCTG GCTGGCCCCG GCAACAATGG AGGCGACGCA
TTCGAAGTGG CGCGTCTGTT GCAGAAACGG TTTTTCGATG TCTGCGTCGT GTTTGCCGCG
CCCCCGGACA AGCTCCCCGC TGACGCCGCT TCAGCCCGCC AGCGCTACGT GGCCGCCGGT
GGCGTCACAA CCCATGCGAT CCCGGAAGAG ACTCGCTGGT CGCTGATCGT CGACGGTCTG
TTCGGCATCG GACTGACCCG CGCACCGGAA GGCGACTACG CCCAATGGAT TACAGCCGCC
AATCGGCTCG CCGAACGTGA CCACTGTCCA TTGCTCGCCC TCGACTGTCC GTCCGGCCTC
AATGCCGATA CCGGCATCGC CCAGGCGCCG AGTATCCATG CCACTCACAC GATCACCTTC
ATTGCCGCCA AACCCGGCCT ATTCACGGCC GACGGGCCAG ACCACTGCGG TGAAATCCGT
GTCGCCAGCC TTGATCTCGA CCCCGTCAGT GAAATCCCAC CAGATGGACA ACGGCTCAGC
CTGGCCGACG TTGATTCGCG CCTGAAACCC CGCCGCAACA ATACGCACAA GGGCAGTTTT
GGCAGCGCCG GCATCCTCGG TGGCGCCAAG TCCATGGTCG GCGCCGCCTT CCTGGCTGGC
CGGGCTGCGC TCAAGATGGG CGCCGGACGC GTCTATCTCG GCATGCTCGA CCCTGAAGCG
CCGTCAGTCG ATCTCCTTCA ACCCGAACTG ATGATGCGCC GGGCTGATGC CCTGCTCCAG
GCCGACTTGC AAGCGCTGGC CTGCGGTCCC GGCCTCGGTC GTTCAGCCGA AGCCCTCCGC
CTCCTTGAAC AGTCGCTGAA GGCCCCCGTC CAGCTGGTTC TAGATGCCGA TGCGCTGAAT
CTGCTGGCCG AAGACAGCCG CCTCGAAGGC AAGCTCTACA ATCGCGTCGG GCCAGCCATC
CTGACCCCGC ATCCAGCCGA AGCTGCCCGC CTGCTCGGCT GTTCGGTCCG CGACATTCAA
AGCGACCGCA TCAAGGCTGC CCGTGAACTG GCCGAGCGCT ATCGCAGCCA CATTGCCCTC
AAGGGGTGTG GCACCCTCAT TGCGACGGTC GATGGTCGCT GGTGGATCAA CACCACCGGC
AACCCCGGCA TGGCCACTGC CGGCATGGGC GATGTACTGA GCGGCCTGAT CGTCGCCCTG
CTTGCCCAGA ACTGGCCGCC GGAAATGGCC CTGCTCGCCG CCGTCCATCT GCATGGCGCG
GCAGCTGATC GCCTGGTTGC CCATGGCCGC GGTCCGATCG GCCTGACGGC TGGCGAAATC
ATCGACACCT CACGAGATAT ATTCAACGAG TGGGTAGCCG TCCCCCCTGC CTGTAAAATG
TTCGCCTCAA ACGCTGCCTG A
 
Protein sequence
MNPSSLYLST TLRRVETRHA TEPLMQRAGA AAAEWVATLA GDQNHPVLVL AGPGNNGGDA 
FEVARLLQKR FFDVCVVFAA PPDKLPADAA SARQRYVAAG GVTTHAIPEE TRWSLIVDGL
FGIGLTRAPE GDYAQWITAA NRLAERDHCP LLALDCPSGL NADTGIAQAP SIHATHTITF
IAAKPGLFTA DGPDHCGEIR VASLDLDPVS EIPPDGQRLS LADVDSRLKP RRNNTHKGSF
GSAGILGGAK SMVGAAFLAG RAALKMGAGR VYLGMLDPEA PSVDLLQPEL MMRRADALLQ
ADLQALACGP GLGRSAEALR LLEQSLKAPV QLVLDADALN LLAEDSRLEG KLYNRVGPAI
LTPHPAEAAR LLGCSVRDIQ SDRIKAAREL AERYRSHIAL KGCGTLIATV DGRWWINTTG
NPGMATAGMG DVLSGLIVAL LAQNWPPEMA LLAAVHLHGA AADRLVAHGR GPIGLTAGEI
IDTSRDIFNE WVAVPPACKM FASNAA