Gene Daro_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1600 
Symbol 
ID3568608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1716322 
End bp1717656 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content61% 
IMG OID637680068 
Producthypothetical protein 
Protein accessionYP_284819 
Protein GI71907232 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.0970446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.868802 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA CGATGGGGCT GGCCCTGGCG GTCGGCACGG TGGTGGTCAT CCTGCTGCAG 
GTGGCCTTGC TGTTGCGGGA TAACCGCAAA CAGGGCGATG CAGAGCACTT CCGGACGCTG
CTGGACGCGC AGGAAAAAGG CTTGATGCGC CTTGAGCGCG AATTGCGCGA GGAGCTGGCC
CGGGGCCGGC GCGAGGATGC CGAAGAGGCA TTCCGCGACC GCGAGGAGCG AGCCCAATCA
GCCAATCTGC TGGGGCAGGC GATCACCACA CAGGTTGGCC AGTTTGGTAC CTTGCAGGCC
GAGCGCCTGG AGGCCTTTGC CCGTGAATTG AATCGTTTTT CGCTGGGGCT GGACGAGCGT
TTCGAGCGCC TCAAAACGAC CGTCGAGGGG CGCTTGACTG CCATTCAGAC GGACAATGCC
AACAAGCTCG AGGAAATGCG CCGTACCGTA GACGAGAAGC TGCATGCGAC TTTGGAGCAG
CGTCTTGGCG AATCCTTCAA GCTGGTTAGC GACCGACTGG AGCAGGTGCA CCGTGGTCTT
GGCGAAATGC AGACGCTGGC AGCCGGTGTT GGCGACTTGA AGCGCGTGCT GACCAATGTG
AAGACGCGTG GTACCTGGGG GGAAGTCCAG CTCTCAGCGC TGCTTGAACA GTTGCTGACG
GCCGATCAGT TTGCTTCCAA TGTTGCGACC CGCCCGGGTA GCAACGAGCG CGTCGATTTT
GCCATCCGCC TGCCGGGCAA GGACGACGGT GCAGTCGTCT GGCTGCCGAT CGACGCCAAG
TATCCGATCG AGGACTACCA GCGTCTGCTT GATGCTCAGG AGCGAGCAGA TCCGGCGGCG
GTCGAGGAGG CTTCGCGGGC CATCGAAACG CGGCTGAAGA GCGAGGCCAA GAGCATCCAC
GAGAAATACG TCTCGCCGCC GCATACGACT GATTTCGCCA TGCTCTACCT GCCGCTTGAA
GGCCTCTATG CCGAGGCGCT GCGCCGGCCG GGGCTGGCTG AGACGCTGCA GCGCGATTTT
CGGGTCAGTC TGGCCGGTCC GACGACCTTG GCCGCGCTGC TTAACAGCCT GCAGATGGGC
TTCCGTACGC TGGCTATCGA GCAACGTTCA GCTGAGGTCT GGGCCGTGCT TGGTGCAGTG
AAGACCGAGT TCGGCAAGTT TGGCGAGGCG CTGGCGCATA CCCGGAAAAA GCTGGACGAG
GCAAGCAACA GTATCGCCAA GGCGGAAACC AGGACCAGGC AACTGTCGCG CAAGTTGAAA
GAAGTCGAAG CGCTACCGGC GGCAGAATCT GAACAATTGA TCGGTGTGGT GGAATTTGAT
GGTGAAGACG AGTGA
 
Protein sequence
MSETMGLALA VGTVVVILLQ VALLLRDNRK QGDAEHFRTL LDAQEKGLMR LERELREELA 
RGRREDAEEA FRDREERAQS ANLLGQAITT QVGQFGTLQA ERLEAFAREL NRFSLGLDER
FERLKTTVEG RLTAIQTDNA NKLEEMRRTV DEKLHATLEQ RLGESFKLVS DRLEQVHRGL
GEMQTLAAGV GDLKRVLTNV KTRGTWGEVQ LSALLEQLLT ADQFASNVAT RPGSNERVDF
AIRLPGKDDG AVVWLPIDAK YPIEDYQRLL DAQERADPAA VEEASRAIET RLKSEAKSIH
EKYVSPPHTT DFAMLYLPLE GLYAEALRRP GLAETLQRDF RVSLAGPTTL AALLNSLQMG
FRTLAIEQRS AEVWAVLGAV KTEFGKFGEA LAHTRKKLDE ASNSIAKAET RTRQLSRKLK
EVEALPAAES EQLIGVVEFD GEDE