Gene Saro_3209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3209 
Symbol 
ID3917467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3426482 
End bp3427507 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content66% 
IMG OID640445993 
ProductArsR family transcriptional regulator 
Protein accessionYP_498478 
Protein GI87201221 
COG category[H] Coenzyme transport and metabolism
[K] Transcription 
COG ID[COG0640] Predicted transcriptional regulators
[COG2226] Methylase involved in ubiquinone/menaquinone biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAACT CGATGACCAT CCTAGACGCC ATCCGTGCGC TCGATGACCC GACGCGCCTG 
CGCATCATGC GCTTGCTCGC CAGCATGGAA CTGGCGGTGG GCGAGGTCGC GCAAGTATTG
GGACAGAGCC AGCCGCGCGT CTCGCGTCAC ATCAAGATCC TTTGCGATTC GGGCCTTGCC
GAACGTCGGA AGGAAGGCGC CTGGGTGTTT CTGCGCAGTT CCATCGGCGA AGGCGCGGAA
AGCCCGCTCG CCTCTGCGCT GGCGCGCCTG CTGGCCGTCG CGGAGCACGA AGACACGGCT
TTTGGCCGCC GTTGCTCCGA AGACCGCCAG CATCTCGACG CTATCCGTTC GTCGCGGGAG
AGCCACGCGC TCGAATGGTT CGCCCGCCAT GCCGACGAGT GGGACGAATT GCGCTCCCTT
CATATCGCCG ATGGTCCGGT CGAGGCCGCG CTTACCGAGA TGCTCCTGGC GCTTTCAGGC
GACGGTTCGC TCGGTCGCCT GCTCGACGTC GGCACCGGTA CCGGCCGGAT CGCCGAACTC
TTTGCGCCCA ATGCCGCCCA TGTCGTCGCC TTCGACAAGA GCCCGGACAT GCTGCGCATC
GCGCGCGCGC GCCTCCAGCA TTTGCCAGCC GACGCGGTGG AACTGGTCCA GGGCGATTTC
GCGCAACTTC CCTTCGCCGC GCGCAGCTTC GATACCGTAC TGTTTCATCA GGTTCTGCAC
TACGCCCAGG CACCGGAAGC AGTGCTCGCC GGCGCGGCTC GCGTTACCGC ACCCGGTGGC
CGCGTCGCCA TCGTCGACTT CGCCGCGCAC GAGCGCGAGG ACCTGCGCCA GACCCATGCC
CACGCCCGCC TCGGCTTCTC CGACGCGCAG ATCGAGACGA TGCTGCTCGA TGCCGGTTTC
ATTCCGCACG AGACCCGCGC GCTCGCCGGC CATGAACTCG TCGTCAAGCT GTGGACCGCA
GTCCGCCGCG AAGACAGCGT CACCCAGCTT GAACCCCGTC AGAAATCCAG CTCCGGAAAG
ACCTGA
 
Protein sequence
MKNSMTILDA IRALDDPTRL RIMRLLASME LAVGEVAQVL GQSQPRVSRH IKILCDSGLA 
ERRKEGAWVF LRSSIGEGAE SPLASALARL LAVAEHEDTA FGRRCSEDRQ HLDAIRSSRE
SHALEWFARH ADEWDELRSL HIADGPVEAA LTEMLLALSG DGSLGRLLDV GTGTGRIAEL
FAPNAAHVVA FDKSPDMLRI ARARLQHLPA DAVELVQGDF AQLPFAARSF DTVLFHQVLH
YAQAPEAVLA GAARVTAPGG RVAIVDFAAH EREDLRQTHA HARLGFSDAQ IETMLLDAGF
IPHETRALAG HELVVKLWTA VRREDSVTQL EPRQKSSSGK T