Gene Saro_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1751 
Symbol 
ID3916326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1845056 
End bp1846600 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content69% 
IMG OID640444492 
Productpeptidase M28 
Protein accessionYP_497025 
Protein GI87199768 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.691853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAG CCGGGTGCGG CAACAGGGGC GGCTTGCGAA AAACGATGAC AGCGGCGCTT 
GTCATCGCGG CGCTGGCGCT CGTCGCCGCA AGGCCCGCGC CGAGGAAGCC CACGCCGCTC
GAGAACACGC TGAAGGTCCA CGTCGAGACG CTGGCAAGCG ACGATTTCGA CGGGCGCGAA
CCTGGCACCG AGGGCGAAAC CAAGACGTTG CGCTATCTGG CGCGGCAGTG GTTCGACATC
GGCATGGAGG CGGGCACGAA CATCCCCGGC AGCGGATGGT TCGCTCCGGT GGAACTGGTC
GAGCGCGAAC CGCTGGTGTC GCGGGCCAGC TTCGTGCGCG GGCGCAAGCG GGTCAGCCTG
CCGGCCGACA GCGTTTTCGT GGTGACTTCC GGCCTGCGCA GCCTTGTCCA GGACGCGCCC
CTGCTGTTCG TCGGCCATGC AACGTCGCCT GGCGTTCCGC GGGCTGAGCT GGCGGGCCGG
GTGGCGGTCA TGCTGGACAG CCAGGCAGCC GTTGGCGATC CTCAGCGGAG CAGCGACCGG
GCCGGAAAGC TGCTGGATGC GGGCGCACTG GCGGTCGTCA CGATCCTCGA TGGCGAACGC
GGGATCGAAG ACGTGATCGC GCGGCGGCGG CGAGCGGGCT ACGCGATCGC CGGCGAGACG
CTGGGCAACG AGATCGAGGC CTTCCTTTCG CCCGCCGCCG CGAACCTGCT GCTGGCCACT
TCCGCCCATC CCAATGTGGC GCGCCTGCGT GCCGATGCCG ACGCACCGGG CTTTGCGCCC
TACCTGCTAG GCATAACCGC GACGTTCGAG GCGACGAGCC GCGAGACGCG GATCAAGACC
CACAACCTCA TCGGCCGGCT CCCGGGGCGC AACCCTGCGG CCGGAGCGGT GCTGATGCTG
GCGCACTGGG ACCACTTCGG CGAATGCGCC GCGCCGCCCG CGGAGGACCT CATCTGCAAC
GGCGCGATCG ACAACGCCTC CGGCCTGGCG GTGATGACCG AGACCGCGCG GCTGCTGTCG
CGCGGGAGAC CGATGGAGCG TGACGTCTAT TTCCTCGCGA CCACGGGCGA GGAACTCGGC
CTGCTTGGCG CCATGGCCTT TGCCGAGGAC CCGCCGATCC CGCTCGACCG TATCGTCGCG
GCCTTCAACG TCGACAGCAC CGGGCTGGTG CCGGCGGGGG CGCCAGTGGG AATCGTGGGC
AGGGGCATGA CCCCGCTCGA CCCGCTGATC GCGGACGTGG TGCGGCGGAT GAAGCGCAAG
CTCGCGCCCG GCGACGGGGC GAACGCCTAT ATCCGCAGGC AGGATAGCTG GATCCTGATG
CAGCACGACG TGCCGACGGT CATGGTCAGC AGCAGCTATG GCAACATGGA ACGCCTCGAA
CGGTTCATGG AAGACACCTA TCACCGCCCG ACCGACCAGG CCGGCGCGGG AATCGACTAT
GGCGGCATGG CCGACGACGT GCTGCTCCAG GCGGAACTGG TCCGCGCATT CGCCGATCCG
AAGCGCTATC CGGGGGCCGG ATCGAAGCGG GCCCAAACCC CCTAG
 
Protein sequence
MKRAGCGNRG GLRKTMTAAL VIAALALVAA RPAPRKPTPL ENTLKVHVET LASDDFDGRE 
PGTEGETKTL RYLARQWFDI GMEAGTNIPG SGWFAPVELV EREPLVSRAS FVRGRKRVSL
PADSVFVVTS GLRSLVQDAP LLFVGHATSP GVPRAELAGR VAVMLDSQAA VGDPQRSSDR
AGKLLDAGAL AVVTILDGER GIEDVIARRR RAGYAIAGET LGNEIEAFLS PAAANLLLAT
SAHPNVARLR ADADAPGFAP YLLGITATFE ATSRETRIKT HNLIGRLPGR NPAAGAVLML
AHWDHFGECA APPAEDLICN GAIDNASGLA VMTETARLLS RGRPMERDVY FLATTGEELG
LLGAMAFAED PPIPLDRIVA AFNVDSTGLV PAGAPVGIVG RGMTPLDPLI ADVVRRMKRK
LAPGDGANAY IRRQDSWILM QHDVPTVMVS SSYGNMERLE RFMEDTYHRP TDQAGAGIDY
GGMADDVLLQ AELVRAFADP KRYPGAGSKR AQTP