Gene Daro_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3601 
Symbol 
ID3568265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3861238 
End bp3864414 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content68% 
IMG OID637682074 
Producthypothetical protein 
Protein accessionYP_286800 
Protein GI71909213 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000186379 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAGCCG CCGCCCGCCT CGTCAATCGC CCGTCGCCAC CGATGCCGGC GGCTGCGGTC 
AAGGTGGCCG AGCCAGTGGT GGCGCAGACG GCGCCGATCC GCATCCAGTG CCAGAGCCTG
CGCGTTTCTA CACCCGGCGA TGCGGCCGAA GTCGAGGCCC GGCATGTCGC GCGCAGCATC
GTCAGCATGC CGGCGACTGC GGCAGCCCGG CCGGCCAGCG TGCTCAGTCC GCCCACGCTG
CATCGGGCCA ATCCGGTGCG CGGGCCGGCG GTTGCCGCGC CCACCCGGCC GCAGCAAACG
GCCGCCAAAC CAACGTCCAG CGGTGGCGAG CCGCTGCCCG AGGCGGTGCG CCAGGACATG
GAGTCGCGCT TCGGCGCCGA TTTCAGCGCC GTGCGCATCC ATCGCGATGC CCGCGCCGCG
CAAGCCAGCA CTGCCCTCAA CGCGGCGGCC TTCACGGTCG GCAACCAGAT CCATTTCGGG
GCCGGCCAGT TCAACCCGGG CAGCGGCGAA GGCCGCGAGC TGATCGCCCA CGAACTGACC
CACACCATCC AGCAGGGCGC CGCACCGCAA GCCCAACGGC AGATCCAGCG CAGCCCGGCC
CCGCTGGTCG TCGAGCGCAG TGCTCCAGCT GTCCAGCGCC TCGGCCTTGG CGATGCGCTC
AATTACTTTG CCGAGCACGC CAATTTCATC CCCGGCTTCC GGATGTTCAC GCTGGTCCTC
GGTGTCAATC CGATCAACAT GCGGGCCGTT GACCGCTCGC CGGGCAACCT GCTGCGCGCT
GTCGTCGAGC TGATGCCGGG CGGCGCGCTG ATCACCCGGG CGCTGGATGG CTACGGCATC
ATCGACCGCG TTGCCGGCTG GGTGCAGCAG CAGATGAGCA GCCTCGGGCT GGTCGCCAGC
AGCATCCGGC AGGCGGTCGA CCGCTTCCTC GACTCGCTGA GCTGGACCGA CATCTTCGAT
CTGGGCGGCG TCTGGGAACG CGCCAAGCGC ATCGTTACCG AGCCGATCGC GCGCATCACC
AGCTTTGTCG GCAATCTGGT TTCCGGCATC CTCCAGTTCA TCCGCGATGC CGTGCTGCGA
CCGCTGGCCG GGCTGGCGCA AGGGACGCGT GGCTGGGATC TGCTCTGCGC CGTACTCGGC
CGAAACCCGA TTACCGGCGA CGCGGTGCCG CGCACGGCCG AAACGCTGAT CGGCGGCTTC
ATGCGCCTGA TTGGGCAGGA GGAAGTCTGG CGCAATCTGC AGGAAAGCCG CGCCGTGCCG
CGGGCGATGG CCTGGTTCCA GGGCGCGCTG GCCGGCCTGA TGGGCTTCGT CACCGCCTTG
CCCGGCCTGT TCATGGACAC CCTGCGCAGC CTCGGCATCA ACGACCTGCT GACCCCGGTC
CAGACTTTCG GGCGCATCGT CCGCGTCTTC GGCGATTTCG CCGGGCGCTT CGTCGGCTGG
GCCGGTGCGC AGGTGATGAG CCTGCTCGAA ATCATCTTCG ACGTCGTCGC GCCGGCCGTG
ATGCCCTATA TCCGCCGGGC CGCCGGCGCC TTCCGGACCA TCGTCGCCAA TCCGGTCAAT
TTCGTCCGCA ATCTGGTCCG CGCCGCCGTT CAAGGCTTCC GCCAGTTCGC CAGCAACATC
CTGACCCACC TGCGCGCAGC GCTGATCGGC TGGCTGACCG GGGCGATGGG CGGCGCCAAT
ATCTACATTC CGCAGGCGCT GACGCTGCAG GAAATCATCA AGTTCGTGCT CAGCGTGCTC
GGCCTGACCT GGCAGAACAT CCGCAGCAAG CTGGTCCGCG CCGTCGGCGA AACGGCGGTC
AATGCCATGG AAACCGGCTT CGACATCGTC GTCACGTTGG TTCGCGACGG GCCGGCCGCT
GCTTGGGAAC GCATCCGCGA AAGCCTCTCC AACCTGCGCG AAATGGTCAT GGAGCAGATC
ATGGCCTTCG TCCAGAACAA CATCGTGATG GCCGCCGTGA CCCGGCTGGT CAGCATGCTC
AACCCGGCCG GCGCCTTCAT CCAGGCCATC ATCGCCATCT ACAACACGGT GATGTTCTTC
GTCGAACGCC TGCGCCAGAT CGCCCAGGTG GCCGCTGCCT TCATCGACTC GATTGCCGCC
ATCGCCGGCG GCGTCATCGC GGCTGCCGCC AACCGCGTCG AACAGACGAT GGCCGGGCTG
CTGACGCTGG TCATCAGCTT CCTCGCCCGA CTGGTCGGCC TCGGCCGGGT CAGCGATGCG
GTGACTAACA TCATCAACCG CATCCGCGCC CCGATCGACC GGGCGCTCGA TCGCGTGGTA
GCCTGGATCG TCAGTCTCGG CCGGCGCTTT ATGACCGCCG CCCGCAGCGC GGCCGGGCGC
GTCGCCGAAT GGTGGCGCCA GCGCAAGCCG TTCCGGACGG CCGGCGGCGA AAGCCACGAG
GTCTATTTCG TCGGCGACGA GCGCAATCCG CGCCCGATGG TGGCCAGCCG AGATCCGCAG
CCGGTCGAAA CGCGCCTCGA CCGTTTCCTG GCTGCGGCCA ACGAGTCCGG CGCCCCGGCC
CGCAAGCGCA ACGCCATCCC GCTGATCGGC GCGACGCGAA CGGCTGTGCG GGCCAACGCC
GATGATCCCA TCGTCGTCAC CAACCTGCGT ACCCTGTTCG GCATCTTCGA CGATCCCTCG
GCGCCGCGCG TCACCCGCTA CCAGCCGCGT ACCCAATCGC TGGGCGGCGA TACGGTCGGC
GTGGGCATGA CCATCGACTG GCTCAACGAC GCCTGGCGCC GAAGCCACCC CGGCAGCCCG
CCTCGCTCCG GCGCGCAGAG CACGCTGATG AGCAAGCTCG AAACCGATCC CGGTGAATCC
AGCCCCGACA AATACATCCG CGGACACCTG CTCAACGAAC ACATTGGCGG CATCGGTGAC
GCCACCAACC TCTTCCCGAT CACCGGCAAC GCCAACAGCC GCCACCTGCA CTCGACCGAA
AGCCGGGTCA AGCGTTGGGT CGATGTGCCC AGCAACTGGG TGTTCTACGA AGTTACGGTA
GACGGCATCA GCTCGCGGCT CAATGCCAGC GACGTCACCC AGAACTACGT CAACGCCACG
TTCAACTGCC GCGCCGTGCT CAAGGACGAC GACGGGACAG AAAAGGAAAG CTACATGACT
GCGATCAGTT CAACCTATCG CGTTCGCAAC GAAGCACGCG CCTTCCAGGT GCGCTGA
 
Protein sequence
MEAAARLVNR PSPPMPAAAV KVAEPVVAQT APIRIQCQSL RVSTPGDAAE VEARHVARSI 
VSMPATAAAR PASVLSPPTL HRANPVRGPA VAAPTRPQQT AAKPTSSGGE PLPEAVRQDM
ESRFGADFSA VRIHRDARAA QASTALNAAA FTVGNQIHFG AGQFNPGSGE GRELIAHELT
HTIQQGAAPQ AQRQIQRSPA PLVVERSAPA VQRLGLGDAL NYFAEHANFI PGFRMFTLVL
GVNPINMRAV DRSPGNLLRA VVELMPGGAL ITRALDGYGI IDRVAGWVQQ QMSSLGLVAS
SIRQAVDRFL DSLSWTDIFD LGGVWERAKR IVTEPIARIT SFVGNLVSGI LQFIRDAVLR
PLAGLAQGTR GWDLLCAVLG RNPITGDAVP RTAETLIGGF MRLIGQEEVW RNLQESRAVP
RAMAWFQGAL AGLMGFVTAL PGLFMDTLRS LGINDLLTPV QTFGRIVRVF GDFAGRFVGW
AGAQVMSLLE IIFDVVAPAV MPYIRRAAGA FRTIVANPVN FVRNLVRAAV QGFRQFASNI
LTHLRAALIG WLTGAMGGAN IYIPQALTLQ EIIKFVLSVL GLTWQNIRSK LVRAVGETAV
NAMETGFDIV VTLVRDGPAA AWERIRESLS NLREMVMEQI MAFVQNNIVM AAVTRLVSML
NPAGAFIQAI IAIYNTVMFF VERLRQIAQV AAAFIDSIAA IAGGVIAAAA NRVEQTMAGL
LTLVISFLAR LVGLGRVSDA VTNIINRIRA PIDRALDRVV AWIVSLGRRF MTAARSAAGR
VAEWWRQRKP FRTAGGESHE VYFVGDERNP RPMVASRDPQ PVETRLDRFL AAANESGAPA
RKRNAIPLIG ATRTAVRANA DDPIVVTNLR TLFGIFDDPS APRVTRYQPR TQSLGGDTVG
VGMTIDWLND AWRRSHPGSP PRSGAQSTLM SKLETDPGES SPDKYIRGHL LNEHIGGIGD
ATNLFPITGN ANSRHLHSTE SRVKRWVDVP SNWVFYEVTV DGISSRLNAS DVTQNYVNAT
FNCRAVLKDD DGTEKESYMT AISSTYRVRN EARAFQVR