Gene Daro_3418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3418 
Symbol 
ID3568314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3671192 
End bp3672190 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content65% 
IMG OID637681890 
ProductKpsF/GutQ 
Protein accessionYP_286617 
Protein GI71909030 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.000032508 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.585789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAA GCCCTAAACC CCATCGCTTT TCGCCGGAAC GCGCTCTGGA ACTGGGTCGC 
CAGACCCTGA GCATCGAGGC TGCCGCCGTA GAGGCCCTGC AGGGCCGAAT CAACGGCGAT
TTCGCCAAGG CCGTCGAGCT GATACTCAAC AGCCACGGGC GTCTGATCGT CAGCGGCATG
GGCAAGTCCG GCCATATTGC CCGCAAGATC GCCGCGACCA TGGCCAGCAC CGGCACCCCG
GCCTACTTCG TCCATCCGGC CGAAGCCAGC CATGGCGATC TCGGCATGAT CACTCGCGAT
GACGTGCTGC TTGCCCTGTC GAACTCCGGC GAATCCGGCG AACTGCTCAG CATCCTGCCC
GCACTGAAGC GCCAGGGCGC CAAGATCATC TCAATGACCG GCGTTCCGAC CTCAACGCTG
GCTCGTGAAG CCGACATTCA TCTCGACGCC GGCGTCGAAC AGGAAGCCTG CCCGCACAAT
CTGGCCCCCA CGGCCAGCAC CACGGCGGCA CTCGCCCTGG GTGATGCCCT GGCTGTCGCC
CTGCTCGATG CCCGTGGCTT CGGGCCGGAA GATTTCGCCC GTTCGCACCC CGGCGGCTCG
CTCGGCCGCC GCCTGCTGAC CCATGTTCGC GATGTCATGC GCGCCGACGA CAAGGTTCCC
GCCGTCACTC CGGCCACCCC CATCACCGAC GCGATCATCG CCATGTCGCG TGGCGGCCTC
GGGCTCGTCG CAATCACCGA TCCGGCCAAT ATCGTCCTCG GCATCTTTAC CGACGGTGAC
CTGCGTCGCG CTTTCGAAAA ACGCATCGAC CTGCAACAGG GCGACATTGC CTCGGTCATG
CACGCCGCGC CGCGCACCAT CGGCCCCGAC CGCCTGGCCG TCGAAGCCGT CGAAATGATG
GAGCGCCTGC GCATCAACGC CCTGCTCGTC GTCGATGCTG AAAATCACCT GATCGGTGCG
CTGAACATGC ACGATCTCTT CACTGCCAAG GTTATCTGA
 
Protein sequence
MNPSPKPHRF SPERALELGR QTLSIEAAAV EALQGRINGD FAKAVELILN SHGRLIVSGM 
GKSGHIARKI AATMASTGTP AYFVHPAEAS HGDLGMITRD DVLLALSNSG ESGELLSILP
ALKRQGAKII SMTGVPTSTL AREADIHLDA GVEQEACPHN LAPTASTTAA LALGDALAVA
LLDARGFGPE DFARSHPGGS LGRRLLTHVR DVMRADDKVP AVTPATPITD AIIAMSRGGL
GLVAITDPAN IVLGIFTDGD LRRAFEKRID LQQGDIASVM HAAPRTIGPD RLAVEAVEMM
ERLRINALLV VDAENHLIGA LNMHDLFTAK VI