Gene Daro_3134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3134 
Symbol 
ID3567635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3377055 
End bp3378401 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content62% 
IMG OID637681605 
Productoxidoreductase molybdopterin binding subunit 
Protein accessionYP_286334 
Protein GI71908747 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value0.0673188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGATA TCACTAAAAA GCCGGGGCGT TTGCGACCAG CGCCGGAGAA CTTCCTGTCG 
GAAGATCAGG TCAAGGCCGT CGGCGCCGGG CGTCGCGATT TTTTGCGCAA GAGCTTCCTG
GCGGCCGGCG CGGCGATGGC AGCACCGCTC GCCGTACGCG CCAGCGAAGG CGATCCGAAT
ATTCTCAATC TGCCGGCCTG GAGCACGTCT CTCGGCATGC CGGTGGCAAC CAATCCGTAC
GGTATGCCAT CGAAATTCGA GAGCCAGTTG GTGCGTCGCG AGTCGCCGGG TCTGGCCCGT
GTCGGCGGCG CCTCGGTGTC GTTTGCGCCG CTGCAGGGCC TGTTCGGCAT CATCACGCCG
TCCGGCCTGC ACTTTGAGCG CCATCACCAG GGCTGGCAGG ATGTCGATCC GACCAAGCAT
CGCCTGATGA TCAACGGTTC GGACGACTCG CTGCTCAAGA AGCCCAAGGT CTACACGATG
GACGAGTTGA TGCGCCTGCC CTCGGTGTCG CGCATCCATT TCATCGAATG CGGCGCCAAC
ACCGGTCTCG AATGGGGCAA TGTCGCCGTG CCGACTGTGC AATACACGCA CGGCATGCTG
TCCTGCTCGG AATTTACCGG TGTACCGCTC AAGCTGCTTC TGGAAGACTG CGGGGTCGAT
TACAAGAAGG CCCGCTATGT GCTGGCCGAG GGGGCTGACG GCTCCTCGAT GACACGTACC
ATTCCGATGG AAATGGTCGA GTCCGGGGAA GTGATCGTCG CCTACGGCCA GAACGGCGAA
ATGCTGCGCC CGGAAAACGG CTACCCGCTG CGCCTGGTCG TGCCGGGCGT GCAGGGCGTG
TCGTGGGTCA AGTGGCTGCG TCGCATCGAA GTCGGTGACA TGCCTTACGC CACCAAGGAC
GAGGCCGTGC ATTACATCGA CCTGCTGCCG AGTGGCCTGC ACCGCCAGTA CAGCTCGATT
CAGGAGGCCA AGTCGGTCAT CACCACGCCG TCCGGCGGCC AGACGTTGGT TGAAAAAGGC
TTCTTCAACG TTTCCGGCCT CGCCTGGTCA GGTCGCGGCC GCATCAAACA GGTCGATGTC
TCGTTTGATG GTGGCCTCAA CTGGCAGACG GCCCGTCTCG AAGGTCCAGT CCAGAACAAG
GCGCTGACCC GCTTCAATAT CAACTGGGTA TGGGACGGTT CGCCGGCCAT CCTGCAGTCG
CGGGCCACCG ACGAAACCGG CTACGTGCAG CCGAGCTACG GCCAGTTGCG CAAGGTGCGC
GGCACTAAGT CCATCTATCA CAACAACGCC ATCCAGTCCT GGAAAGTCGT CGAAAGCGGG
GAGGTCACCA ATGTCCAGGT TCTCTAA
 
Protein sequence
MGDITKKPGR LRPAPENFLS EDQVKAVGAG RRDFLRKSFL AAGAAMAAPL AVRASEGDPN 
ILNLPAWSTS LGMPVATNPY GMPSKFESQL VRRESPGLAR VGGASVSFAP LQGLFGIITP
SGLHFERHHQ GWQDVDPTKH RLMINGSDDS LLKKPKVYTM DELMRLPSVS RIHFIECGAN
TGLEWGNVAV PTVQYTHGML SCSEFTGVPL KLLLEDCGVD YKKARYVLAE GADGSSMTRT
IPMEMVESGE VIVAYGQNGE MLRPENGYPL RLVVPGVQGV SWVKWLRRIE VGDMPYATKD
EAVHYIDLLP SGLHRQYSSI QEAKSVITTP SGGQTLVEKG FFNVSGLAWS GRGRIKQVDV
SFDGGLNWQT ARLEGPVQNK ALTRFNINWV WDGSPAILQS RATDETGYVQ PSYGQLRKVR
GTKSIYHNNA IQSWKVVESG EVTNVQVL