Gene Saro_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0944 
Symbol 
ID3918030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp991721 
End bp993067 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content67% 
IMG OID640443678 
ProductPyrrolo-quinoline quinone 
Protein accessionYP_496223 
Protein GI87198966 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCA ACCCCAATAT TCGCCGCGCT TCGCTCGCCA GCGTGCTGGT CCTTGCGCTC 
GCGCTGGGTG GCTGCGGCAT TTTCGGCGGC AAGGACAAGG CCAAGACGAC CCCGACGCTG
GGCCAGCGCG TGCCGATCCT GTCGAAGATC GAGGCCGGCA CAAAGGTCGA CGATTCGATC
TCGCTGACCA CGGTCGTCCT GCCCGCGCCC GAAGTGAACG CCGATTTCGC GCAGGGCGGC
GGCAATGCGA GCAAGTCCTA CGGCCACCTC GCGCTCGGCG ATGCGCCGCG CAAGGCCTGG
ACGGTCGGGA TCGCCGGGTC CTCCTCGAAG CAGCGTCTGG CCGCGTCGCC GGTCGTCGGC
GGGGGCAAGC TCTACGTGAT GGACACGGAC GGCACCGTCC ATGCCTTCGA CGCGGCGAGC
GGGAAGTCCG TGTGGGAGAC CCCGGTCAAG GCCGAGAAGC AGAACGCCAA CTCCACCTTC
GGCGGCGGCG CGTCCTATGA CGACGGCGTG GTCTATGTGA CCAATGGCGT TGGCGAAGTC
GCTGCGCTTG ATGCCGCTAA CGGAGCGGTA AAGTGGCGCG TCAAGCCCGC CGGCCCGCTG
CGCGGATCGC CCACGGTCGC CTTCGGGCAG GTCATGGCGA TGACCCAGGA CAACCAGATC
GTCACCCTGA ACGCCGCCGA TGGCGTGGTC CTGTGGAACG AGAACGCCTC GGTCGGACAG
ACCAACGTGT TCGGCGTCGC CTCGCCCGCG GCAGGGCAGG GCACGATCGT GGCCGGTTAT
TCCTCGGGCG AACTGGTCGC CTACCGCTAC GAGAACGGGC GCCAGCTCTG GGCCGACGCC
CTTGCGCGCA CCAGCATCGC GACCAGCGTC TCGACGCTGA CCGACATCGA CGCCGATCCG
ATCATCGAGC GCGGCCGCGT CTTCGCGCTG GGGCAGGGCG GGCGCATGGC CGCCTACGAA
CTCGTGACCG GCCAGCGCGT GTGGGAACTC AATCTCGCGG GCATCTCCAC CCCCGCCATC
GCCGGTGACT GGATCTTCAC GCTGACCGAC GAGGCCAAGC TGCTGTGCAT CGCCAAGTCC
AACGGCAAGG TCCGCTGGAT GACGCAGCTT CCGCGTTATC GGAACGAGAA GAAGAAGAAG
AACCAGATCC TGTGGACCGG CCCGGTCCTT GCCGGCAACC GCCTGTGGAT CGCCAATTCG
CGCGGCGAAG TGATGCACGC ATCCGTCACC GACGGCACCG TCAGCGAATT CACCAAGCTC
GGCGCGGCGG TAAGCCTTGC CCCCGTGGTC GCGAACCAGA CGCTCTACAT CCTCGACGAC
AACGGCAAGA TCACCGCGTT CCGCTGA
 
Protein sequence
MTPNPNIRRA SLASVLVLAL ALGGCGIFGG KDKAKTTPTL GQRVPILSKI EAGTKVDDSI 
SLTTVVLPAP EVNADFAQGG GNASKSYGHL ALGDAPRKAW TVGIAGSSSK QRLAASPVVG
GGKLYVMDTD GTVHAFDAAS GKSVWETPVK AEKQNANSTF GGGASYDDGV VYVTNGVGEV
AALDAANGAV KWRVKPAGPL RGSPTVAFGQ VMAMTQDNQI VTLNAADGVV LWNENASVGQ
TNVFGVASPA AGQGTIVAGY SSGELVAYRY ENGRQLWADA LARTSIATSV STLTDIDADP
IIERGRVFAL GQGGRMAAYE LVTGQRVWEL NLAGISTPAI AGDWIFTLTD EAKLLCIAKS
NGKVRWMTQL PRYRNEKKKK NQILWTGPVL AGNRLWIANS RGEVMHASVT DGTVSEFTKL
GAAVSLAPVV ANQTLYILDD NGKITAFR