Gene Daro_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1054 
Symbol 
ID3568217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1156049 
End bp1157095 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content61% 
IMG OID637679516 
Productselenophosphate synthetase 
Protein accessionYP_284280 
Protein GI71906693 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase 
TIGRFAM ID[TIGR00476] selenium donor protein 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGAAG AAAAAATCAG ACTTACCCAG CTTTCCCATG GTGGTGGCTG TGGTTGCAAG 
ATCGCGCCTG CCGTATTGCA GAAAATTCTG GCCGGCACCA CGGGCAGCAT CATTCCGCCG
CAGCTACTGG TCGGTACCGA GACCAGCGAC GACGCTGCGG TCTACCAGAT CAACGCGCAG
CAGGCGATTG TCGCGACGAC CGACTTTTTC ATGCCGATCG TCGACAATCC TCGCGATTTC
GGGCGCATCG CGGCGACCAA TGCCATTTCG GATGTTTATG CCATGGGCGG GACGCCGTTG
TTCGCGCTGG CGCTAGTCGG CATGCCGGTC AATGTCCTGC CGCTGGAAAC CATCGGCCAG
ATTCTGCAAG GCGGTGAGGA CGTCTGCCGG GCAGCCGGCA TTCCCATTGC CGGCGGCCAT
ACGATCGATT CGGTTGAGCC CATCTATGGC CTGGTGGCCA TCGGCTTGGT CAACCCGGAA
CATTTGAAGC GCAATTCCGG CGCCAAATCC GGGGACAAGC TGATCCTTGG CAAGCAACTC
GGTGTGGGGA TCTACAGCGC GGCGCTGAAA AAGGATCAAC TCCAGGCCAA GGATTACGAA
GCCATGGTCG AGACCACAAC CCAGCTCAAT ACGCCGGGGC CGGTATTGGC CTGTCTGGAT
GGTGTTCATG CCGTGACCGA CGTCACCGGC TTCGGGCTGG CCGGTCATCT GCTGGAAGTC
TGCAAGGGCA GCGGCCTGCG GGCGACAGTG AATTACCAGG ATTTGCCGGT ATTGCCCAAA
GCTCGCGAGT TCATGCAGGC CGGACTGATG ACCGGCGCTT CGGGACGCAA CTGGGCGAGC
TACGGCGAAG GTGTGCGTAT CGCCGACGGC CTCGAAGGCA TCGCGCAGAC CTTGCTGACT
GACCCCCAGA CATCCGGTGG TTTGCTGGTT TCATGCTCGC CGGAAACGGT GACGGAAGTG
CTCTCCTTGT TCCTGCAGCA CGGCTTCCCC CACGTTTCGG TGATCGGCGA AATGGCCGAA
GGCGAACCGG GCATCGACGT CATTTAA
 
Protein sequence
MPEEKIRLTQ LSHGGGCGCK IAPAVLQKIL AGTTGSIIPP QLLVGTETSD DAAVYQINAQ 
QAIVATTDFF MPIVDNPRDF GRIAATNAIS DVYAMGGTPL FALALVGMPV NVLPLETIGQ
ILQGGEDVCR AAGIPIAGGH TIDSVEPIYG LVAIGLVNPE HLKRNSGAKS GDKLILGKQL
GVGIYSAALK KDQLQAKDYE AMVETTTQLN TPGPVLACLD GVHAVTDVTG FGLAGHLLEV
CKGSGLRATV NYQDLPVLPK AREFMQAGLM TGASGRNWAS YGEGVRIADG LEGIAQTLLT
DPQTSGGLLV SCSPETVTEV LSLFLQHGFP HVSVIGEMAE GEPGIDVI