Gene Daro_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1389 
Symbol 
ID3566117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1519188 
End bp1520117 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content64% 
IMG OID637679857 
Producthypothetical protein 
Protein accessionYP_284608 
Protein GI71907021 
COG category[P] Inorganic ion transport and metabolism
[S] Function unknown 
COG ID[COG0586] Uncharacterized membrane-associated protein
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0000000831518 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.286755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATTT CCCAGCTTGG CGAAGCGCTC CAGCGTGATG CAGTCTGGGT CGTTTTCCTG 
AACGTCCTGC TGCAACAGAT CGGGCTACCG GTGCCAGCCG TTCCCACCCT GCTCCTGGCC
GGCAGCCTGA TCGTCGGCTC CTGGCAGTTT GCCAGCGTCT TGGCCGCCGC CATCGTCGCG
TCGGTGCTGG CCGACTGGGT CTGGTACCTG GCTGGGCGCG CCTTTGGCTA TCGCGTGCTG
GCCGGCCTAT GCAAACTGTC GATCAATCCC GGCTCCTGCG TCAGCCAGAC CGAGGCCCGC
TTTGTCCGCT GGGGGCTTGG ATCGCTGGTC TTTGCCAAGT TTGTTCCGGG TTTTTCAACG
GTGGCACCAC CCATTGCCGG CTCATTGCGC ATGGGCTTGC CGGGTTTCCT GCTTGCCGCC
GCCACCGGGG CTGCCCTGTG GGCCGGGCTT GGCCTGGGCA CGGGCTGGCT TTTGCGTAAA
GAGGTGCATC GCGCCATCGA GGCGCTGGAC CAGAACTCCG GAAGCCTGCT CGGGCTGATC
GCCGGCACGA TCGCGCTGTG GCTGGGCTGG AAGCTATGGC AGAAATATCG CTTCCGGCAA
TTGTCGGCTG TCCCTCACAT CACGCCGGTT GAGCTTATGG CCGCAATGGA AACCGACCAG
CCCCCGCTGG TGCTCGATCT GCGCGGGCAC AGCATGGTGG CCGAAACCGG CCCGATCACC
GGTGCAACAG TGGCCGAACA TGACAGGCTG CTCGATGCCG TGGGCGAATG GCCCAAAAAC
CTGCCTATCG TGACCTTGTG CGCCTGCCCG GAGGACGCCG GGGCGATACA GGCAGCCCGC
CAATTGCTCA ACGCAGGCTT CCTGTCGGTA CGGCCACTCA AGGGGGGATA CGAAGCTTGG
CTAGCGACCG CCAATGGGAA CAACGTCTGA
 
Protein sequence
MEISQLGEAL QRDAVWVVFL NVLLQQIGLP VPAVPTLLLA GSLIVGSWQF ASVLAAAIVA 
SVLADWVWYL AGRAFGYRVL AGLCKLSINP GSCVSQTEAR FVRWGLGSLV FAKFVPGFST
VAPPIAGSLR MGLPGFLLAA ATGAALWAGL GLGTGWLLRK EVHRAIEALD QNSGSLLGLI
AGTIALWLGW KLWQKYRFRQ LSAVPHITPV ELMAAMETDQ PPLVLDLRGH SMVAETGPIT
GATVAEHDRL LDAVGEWPKN LPIVTLCACP EDAGAIQAAR QLLNAGFLSV RPLKGGYEAW
LATANGNNV