Gene Saro_3247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3247 
Symbol 
ID3917505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3467726 
End bp3469084 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content68% 
IMG OID640446031 
ProductO-antigen polymerase 
Protein accessionYP_498516 
Protein GI87201259 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAGA CTCGGCCCAA CCAGCTTGGC GCTCTGCTTC TCGTGGCTTG TCTGTTTGGA 
GGTGGCGGCG TCGCCTATGG CCTCGCAAAC CTTGTCGTAC AGCTCGCGGC TATCCTGCTA
CTTGCACTCC ACCGCGCGGA ACTCGGCAAG TTTCTCGCCC GTTCGCCGCG GATGCTCGCT
GGCCTGGTTG TACTAACCCT GGCTCTCCCG CTGGTGCAGC TATTGCCGCT GCCGCCTGCC
GCATGGACCA CCCTTCCCGG CCGGGATTTC GTCAACGAGG CGCTTGCCGT GGCCGATGGC
GCGACCACAG CGTCCGGATG GTTTCCCTTC ACGGTCAGCA GCGCCCGCAC CCTCGTCGCC
TTTCTTGGCT TGCTCGCACC GTTCGCGGTG ATCGTCCTCG CATGGCGGCT GGACGAGGCC
GCGACCGTCA GGATCATGCA CCTTGTGGTC ATGATCGGGC TCGCCAATGT GCTGTTAGGT
GTCGTTCAGG TCCTCGGCCA GGGCGGTTCC GGGCAACTCT ACATCGAGAA TGAGATGCCG
GGCGTGCTCT TCGGGTTCTT CGCGAACCGC AATTCGACCG GCGTATTCCT TGTCTGCTGC
CTGCTCGTCC TCGCAGCCCT GCCCGCAGCC CGCCCGCTGT CGGGCATCTG GCTGACCAAG
GCAGGCGCAG CGCTGCTCCT CGCCACCGGC GTGTTCCTCA CCCAGTCGCG CACCAGCATG
GTGCTGCTGG GGCTCCCCGC CGCGTTTGCC GTTCTGCGCA TCGGAGCGAT GGCGCTCGAC
CGCCGCGTCG GCGGGAGCGG GCGCAATGCG GCCCGTACGG CTCTCGGCGG CGCGCTTGTC
GCGCTTGCGC TGGGCGCGAC GCTGACCGTT GCCGGTGGCG GATCGCGCAT CGACACCGCG
CTAGCCCGTT TCGAACGATC CGAGGAACAG CGGCCAGCTA TCTGGGAAGA TACCCGCTAC
GCCATCGAGC GATACTGGCC GGTCGGTGCC GGGATGGGCA CGTTCGACGA AGTCTTCCAG
ATCGACGAAT CGCTCGAGAA CATCACGCCG CGCCGCGCCG GGCGCGCACA TAACGACTAC
CTCGAGATCG CGGTCGAGGC CGGGGTCGTC GGCCTCGCAG TGGTCGCGCT CTGGGCGATC
TGGGCCGCGT TCGCCTCATG GCGGGCCGCG TCCACGCCGC AGCGCTGGCC CGCGCTTGCG
GGAACGGGAA TCCTGATGGC CGTCGCTCTC CAGTCGCTGC TCGACTATCC GCTGAGGAAC
CAGGCCATGC TGTGCATCGC CGCGCTTGCG GTCGCGCTTC TCACACGCGC GGGGCGCAGC
GACGCGTCAG GGCACGTCGC CGGAGGTGCC GGCCGATGA
 
Protein sequence
MFKTRPNQLG ALLLVACLFG GGGVAYGLAN LVVQLAAILL LALHRAELGK FLARSPRMLA 
GLVVLTLALP LVQLLPLPPA AWTTLPGRDF VNEALAVADG ATTASGWFPF TVSSARTLVA
FLGLLAPFAV IVLAWRLDEA ATVRIMHLVV MIGLANVLLG VVQVLGQGGS GQLYIENEMP
GVLFGFFANR NSTGVFLVCC LLVLAALPAA RPLSGIWLTK AGAALLLATG VFLTQSRTSM
VLLGLPAAFA VLRIGAMALD RRVGGSGRNA ARTALGGALV ALALGATLTV AGGGSRIDTA
LARFERSEEQ RPAIWEDTRY AIERYWPVGA GMGTFDEVFQ IDESLENITP RRAGRAHNDY
LEIAVEAGVV GLAVVALWAI WAAFASWRAA STPQRWPALA GTGILMAVAL QSLLDYPLRN
QAMLCIAALA VALLTRAGRS DASGHVAGGA GR