Gene Daro_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3603 
Symbol 
ID3568267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3865825 
End bp3867315 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID637682076 
Producthypothetical protein 
Protein accessionYP_286802 
Protein GI71909215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000547172 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAGCGCCA CGCACCGTAT CCGCCGCCAG CGCTGGCAGG TTCGCACTGC CAGCGCCGCC 
GATGCCTTTG CCGTGCGCAC GGCCCTGCGG CAGGAGAACG AAGTCAGCCT GCTGCCCGCG
CTGGAAAGCG TCTTTGCCGC GCTTGATGAT GGCGAGCGGG AAATCCATTT GCCGCGCCTC
GAACTGGCTA TCCGCATTTC ATTGCCCGAG CGGCTGGCCG AAGAGTTGCC GGTCATGCTG
GCCGAGGCTG CCCGTCAGGC GCTGGCCGAA GTCATCGATG TGCCGTCGCA GGAGCCGGCC
GCCATGCCGC GCAGCCTGAC GCCGGGCAAG CGATTGCGCC GCTATCTGGG CAGCGGCCAG
GTCGACTGGT TCGATGCCGA TCGTGAACAG TCCGAACTGC AGCAGCAACT GGCGGACGAA
GCCCGCCAGT GGAGCGCCTC GCCGGCCGCC GCCTGGCCCT GTCTGCTGGC CGATCTGCCA
GCCGGCGGAC AGGCGCGCGC CGATGCCTTC TTCCGCTTCC TGCAACTGCT CGATATCGCC
GACCGCCTGC GCTGGTGGGA TTTTGCCGCC CGTCTGGCCG AGGCCCACGA TGGCGAATCG
CCGGCCCGAC TGGCGCTGCT GCGTCAGATG GCGGCGACAC GCCCGGCTGA CCATGCACTC
CGCCTGCAGG CGCTCGGCTT GCTGGCGCTG TCGGCCGACC CGGCCCGTTC TTCCCGCCAT
CGCAGCGAGT GTTTGCGGGT GGCGCAGGCC TGTGCCGGGC AGCTGGAAAC CTTTGCTGCG
GTTGACCAGA AAAACTGGCT GAAAATCGAA AGCTGGCTCG GCGGCGAGAG CGGGGCAATA
ACGCTTGATC TGGATAACGG CAGCGCGAAG GCAGCAATAG ATGCAACGAG CCCTCCGCTA
CCGGAGGCAG CGGGAGTCGG GAAAGCACCG AATCTTTCGA CTGATCGTGA TGGTGAACCT
GGCCAGGCCC TTGGGCTGCC GCTGCGCTCG GCTGGATTGA TCCTGCTCCA TCCGTATCTG
CCAAGGTTGT TCGCGGCACT CGGCTGGGTC AGCGCCAGCC ATCCGCCCGG CGAGCCCTTC
CCGTGGGCCA GGTTGCCGCA TGCCGCCGCC TTGCTCAACT GGCTGGCCAC CGGCCGCGAC
GAGCCTTTCG AATTCGAACT CGGCACCGCC AAACTGTTGC TTGGCCTGCA GCCGGACGCC
CCGCTGCCAG TCGCCGCCGG CCTGATCGGC GATGCCGAAA GGGAGGAAGG CGAGGCCCTG
CTCGGCGCTG TCATCCTGCA CTGGTCTGCC CTCGGCCAGA CCACCGTCGA CGGCCTGCGC
GTCGCCTTCC TGCAACGCGG TGGCCTGCTT TATCCGGCGC CTGACGGCTG GCTGTTGCGG
CCGCAAGGCG AAACCTTCGA CCTCTTGCTC GACCGCTTGC CCTGGGGCTT GTCCATCATT
CGCCTGCCCT GGATGCCTAG CTCTCTCCAC ACCGAATGGC TGAGCGCCTG A
 
Protein sequence
MSATHRIRRQ RWQVRTASAA DAFAVRTALR QENEVSLLPA LESVFAALDD GEREIHLPRL 
ELAIRISLPE RLAEELPVML AEAARQALAE VIDVPSQEPA AMPRSLTPGK RLRRYLGSGQ
VDWFDADREQ SELQQQLADE ARQWSASPAA AWPCLLADLP AGGQARADAF FRFLQLLDIA
DRLRWWDFAA RLAEAHDGES PARLALLRQM AATRPADHAL RLQALGLLAL SADPARSSRH
RSECLRVAQA CAGQLETFAA VDQKNWLKIE SWLGGESGAI TLDLDNGSAK AAIDATSPPL
PEAAGVGKAP NLSTDRDGEP GQALGLPLRS AGLILLHPYL PRLFAALGWV SASHPPGEPF
PWARLPHAAA LLNWLATGRD EPFEFELGTA KLLLGLQPDA PLPVAAGLIG DAEREEGEAL
LGAVILHWSA LGQTTVDGLR VAFLQRGGLL YPAPDGWLLR PQGETFDLLL DRLPWGLSII
RLPWMPSSLH TEWLSA