Gene Daro_3323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3323 
Symbol 
ID3566307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3572257 
End bp3573987 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content58% 
IMG OID637681795 
Productcytochrome c, class I:cytochrome d1, heme region 
Protein accessionYP_286522 
Protein GI71908935 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000018049 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTTCAA GAAAATTCAA ACGTAGCGGC CTTGCGGCAT TGCCGCTCCT GGCGATGCTG 
GTCCAGTTCG CCTTTGCCGG GACGCCGTCC GAACACGGCG ATGGCGCGAT GCGCGACTAC
CAGGCGGCTG GCGGTTCGCC GCTGGCTGAC GTGAAGATGC ACCAGGATAT CAACCCGCTG
GCGCCCAAGA TGTCGGAAAC CGAGTTCGAC AAGGCCAAGC GTATCTTCTT CGAACGCTGT
GCCGGCTGTC ACGGCGTGCT GCGCAAGGGG GCGACCGGCA AGCCGCTGAC TCCCGACATC
ACCCTGGGAA AGGGGCTGGA ATACCTGAAG GTCTTTATCA AGTACGGTTC GGCCGGCGGC
ATGCCGAACT GGGGCACCTC CGGCATCCTG ACCGACGATG AAGTCGACCT GATGGCCCGC
TATATCCAGC AGACGCCACC GGCCCCGCCA GAATTCGGCC TGAAGGATAT GGAAGCCTCG
TGGAAGGTCA TCGTCCCGGT CGATCAGCGG CCAAAGCGGA AAATGAACAA CATCAACCTG
GAGAACATGT TCTCGGTCAC CCTGCGCGAT GCCGGCGAAA TCGCGCTGAT CGACGGTGAC
AGCAAGAAGA TCATCAGTAT CCTGAACACC GGCTATGCCG TTCATATCTC GCGTATGTCG
GCTTCCGGTC GCTACATGTT CGTGATCGGC CGCGATGCCA AGATCAACCT GATCGACCTG
TGGATGGAAA AGCCGGATAC CGTGGCTGAA ATCAAGGTCG GCATGGAAGC ACGCTCGGTT
GAAAGCTCCA AGGCCAAGGG CTTTGAAGAC AAGTATGCGA TTGCCGGTAC CTACTGGCCG
CCGCAGTTCG TGATCATGGA TGGCGACACG CTCAAGCCGC GCAAGATCGT TGCCACCCGC
GGCATGACCG TCGGCACGCA GGACTACCAT CCCGAGCCGC GTGTCGCTGC CATTGTTTCC
TCGCACTTCA ATCCGGAATT CTTCGTTAAC GTGAAGGAAA CCGGCATGGT CTATTCGGTC
GACTACCGCG ACCTGAACAA CCTGAAGATC AAGATGATCG AAGCCGCTCC CTTCCTGCAT
GATGGTGGTT TCGAGTCCAC GCATCGCTAC TTCATGGATG CTGCCAACGC CTCCAACAAG
ATCGCGGTCA TCGATACCAA GGAAGGCAAG CTGGAGAAAC TGGTTCCGGT CGGCAAGACG
CCGCACCCGG GCCGTGGCGC CAACTTCATC GATCCCAAGT TCGGGCCGGT ATGGGCAACG
GGCCACCTCG GCGACGAGAG CATCACGCTG ATCGGCACCG ATCCGAAAAA GCATCCGGAC
AATGCCTGGA AGGTTGTCCG CACGCTGAAG GGGCTGGGCG GTGGTTCGTT GTTCCTGAAA
ACGCATCCGA AGTCGAAGAA CCTGTGGGTC GATACCACGC TCAATCCGGA AGCCGGTATC
AGCCAGTCAG TGGCCGTTTG GGATATCAAC AACCTGGAAA AGGGTAGTGA GTTGATCCCG
ATCGGCGAAT GGTCGGGCAT CAAGGATGGT CCGAAGCGCG TTGTCCAGCC GGAGTACAAC
AAGGCCGGTG ACGAAGTCTG GTTCTCCGTC TGGAACGGCA AGGATCAGGA ATCTGCCATC
GTGGTGGTCG ATGACAAAAC GCGCAAGCTG AAGGCCGTCA TCCGTGATCC GAAGCTGGTC
ACGCCAACCG GCAAGTTCAA CGTTTTCAAT ACGCAGCACG ACGTCTACTA A
 
Protein sequence
MISRKFKRSG LAALPLLAML VQFAFAGTPS EHGDGAMRDY QAAGGSPLAD VKMHQDINPL 
APKMSETEFD KAKRIFFERC AGCHGVLRKG ATGKPLTPDI TLGKGLEYLK VFIKYGSAGG
MPNWGTSGIL TDDEVDLMAR YIQQTPPAPP EFGLKDMEAS WKVIVPVDQR PKRKMNNINL
ENMFSVTLRD AGEIALIDGD SKKIISILNT GYAVHISRMS ASGRYMFVIG RDAKINLIDL
WMEKPDTVAE IKVGMEARSV ESSKAKGFED KYAIAGTYWP PQFVIMDGDT LKPRKIVATR
GMTVGTQDYH PEPRVAAIVS SHFNPEFFVN VKETGMVYSV DYRDLNNLKI KMIEAAPFLH
DGGFESTHRY FMDAANASNK IAVIDTKEGK LEKLVPVGKT PHPGRGANFI DPKFGPVWAT
GHLGDESITL IGTDPKKHPD NAWKVVRTLK GLGGGSLFLK THPKSKNLWV DTTLNPEAGI
SQSVAVWDIN NLEKGSELIP IGEWSGIKDG PKRVVQPEYN KAGDEVWFSV WNGKDQESAI
VVVDDKTRKL KAVIRDPKLV TPTGKFNVFN TQHDVY