Gene Daro_3274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3274 
Symbol 
ID3566100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3524795 
End bp3526480 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content56% 
IMG OID637681746 
Productcytochrome d1, heme region 
Protein accessionYP_286474 
Protein GI71908887 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value0.825394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA ACGTAGTGGG GATGCTTGCA CTTAGCGCCA TGGCTTTTGC CATGGGTTCT 
GCCTTCGCTC AGGAAACTGC CAAGGCGGCT CCGGAAATGA CGGCTGCCGA AAAAGAACAG
GCCAAGAAGA TTTATTTTGA ACGTTGTGCT GGTTGTCACG GTGTGCTGCG CAAGGGCGCT
ACCGGCAAGA ACCTTGAGCC GCACTGGACG AAGAAGGACA AGGACGGCAA CGTTACCGAA
GGCGGTACGC TGAAGCTTGG GCAGCAGCGT CTGGAAAAAA TTATCGGTTA CGGTACCGAT
GGCGGCATGG TCAACTTCGA CGACATCCTG ACCAAGGAAG AGCTGACCCT GATGGCCAAG
TACATCCAGA ACACGCCGGA TGTGCCACCC GAGTACAGCT TCAAGGATAC GATGGATTCC
TGGAAGGTCA TGGTGCCGGT CGACAAGCGC CCGACCAAGC AGATGAACAA GTACAACCTG
AAGAACATGT TCTCCGTCAC CCTGCGTGAC ACTGGCGAAG TGGCGCTGAT CGATGGCGAT
ACCAAGGAAA TCCGCAGCAT CGTCAAGACT GGTTATGCCG TTCATATCTC CCGTCTTTCT
GCTTCCGGTC GTTACGTTTA TGTGATCGGT CGCGATGGTC GCTTGTCGTT GATCGACCTG
TGGATGGAAC AGCCGGCGGT TGTGGCAGAA GTCAAGATCG GTTTCGATGC CCGCTCGGTC
GATACTTCCA AGTTCAAGGG CTTTGAAGAC AAGTACGCCG TGGCTGGTTC CTACTGGCCG
CCCCAGTACG TGATCATGGA TGGCGATACG CTGAAGCCGC GCAAGGTCGT TTCCACCCGT
GGCATGACCG TTGATGGCGA ATACCATCCG GAACCGCGCG TGGCTTCCAT CGTCGCGTCC
TTCACCAAGC CGGAATGGGT TATCAACATC AAGGAAACCG GCCAGATCCT GTTGGTCGAT
TACTCTGATA TCGAAAATCT GAAGACGACG ACCATCGGTT CCGCCAAGTT CCTGCATGAC
GGTGGCTGGG ATGCTTCCAA GCGCTACTTC CTGGTGGCTG CCAATGCGTC CAACAAGATC
GCGGCGGTTG ACACCAAGAC CGGCAAACTG GCCGCTCTGG TCGATGTCGC CAAGATCCCG
CACCCGGGTC GTGGCGCCAA CTTCACCCAT CCGAAATTTG GTCCGGTATG GACCACCGGT
CACCTCGGTG CCGATGTCCT GACCCTGATC AGCACACCTT CGGATAAGAA GTCGGATGCC
AAGTTCAAGG AATACAACTG GAAGGTTGTT CAGGAAGTGA AGCACGTTCC GGGCAATTTG
TTCGTCAAGA CCCATCCGAA GTCCAAGCAC CTGTGGGCTG ACTCTCCGCA GAATCCGGAA
AAGGAACTGG CTGAATCCGT TGCCATCTGG GATGTGGCTG ACCTGTCGAA GCCGGTCAAG
GTTATCAACG TGGCCAAGGA TTCCGGCCTG CCGGTGACCA AGGCGACTCG CCGTGCCGTT
CATCCGGAAT ACAGCGCTGA TGGCAAGGAA GTCTGGATCT CCCTGTGGGG CGGCAAGACC
GACCAGTCAG CCATCGTGGT TTATGACGAT GCGACGCTGA CCCTGAAGAA GGTGATCACC
GATCCGAAGA TGATCACCCC GACCGGCAAG TTCAACGTCT TCAACACCCA GCACGACATC
TATTGA
 
Protein sequence
MKKNVVGMLA LSAMAFAMGS AFAQETAKAA PEMTAAEKEQ AKKIYFERCA GCHGVLRKGA 
TGKNLEPHWT KKDKDGNVTE GGTLKLGQQR LEKIIGYGTD GGMVNFDDIL TKEELTLMAK
YIQNTPDVPP EYSFKDTMDS WKVMVPVDKR PTKQMNKYNL KNMFSVTLRD TGEVALIDGD
TKEIRSIVKT GYAVHISRLS ASGRYVYVIG RDGRLSLIDL WMEQPAVVAE VKIGFDARSV
DTSKFKGFED KYAVAGSYWP PQYVIMDGDT LKPRKVVSTR GMTVDGEYHP EPRVASIVAS
FTKPEWVINI KETGQILLVD YSDIENLKTT TIGSAKFLHD GGWDASKRYF LVAANASNKI
AAVDTKTGKL AALVDVAKIP HPGRGANFTH PKFGPVWTTG HLGADVLTLI STPSDKKSDA
KFKEYNWKVV QEVKHVPGNL FVKTHPKSKH LWADSPQNPE KELAESVAIW DVADLSKPVK
VINVAKDSGL PVTKATRRAV HPEYSADGKE VWISLWGGKT DQSAIVVYDD ATLTLKKVIT
DPKMITPTGK FNVFNTQHDI Y