Gene Saro_2825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2825 
Symbol 
ID3915464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3051535 
End bp3053154 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content68% 
IMG OID640445604 
Productpeptidase M28 
Protein accessionYP_498095 
Protein GI87200838 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.741116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTCA AGCTGCTCGT TTCCGCCATG CTTCTCGCCT CGACCGCCGT GCCAGCGCTG 
GCGGAGGAAC CGGTTTTCGA TCCGCAGGCA GTGCGGGCCC ACGTCACGTT CCTTGCCGAC
GATCTTCTGG AAGGCCGCGA CACGGGTTCG CGCGGCTACG ACATCGCCGC CAGCTACGTC
GCCTCGCAGT TCGTCGCGAT GGGTCTGAAA CCCGCGGGGC CTGACGGTTC CTTCTACCAG
AAGCTGACCG TGCGCGAGGC GCGGCTCGAT GGCGCGCCGA AGCTGGCGCT GAACTATGGC
GGCAAGGAGA CGGTGCTGTC CGATACGGCC CAGGTGCTGG TGCGGCCGAG CCTGACCGAC
AAGGCGGTCG CGGTGGATGC GCCGCTGGTC TTCGCCGGCT TCGGCTTCGA CCGTCCGGAC
CTCGGCTTCG ACGACTACAA GGGGCTGGAT GCCAAGGGCA AGATCGTGGT CGTGCTCTCG
GGCTTTCCCA AGGGCACGCC GAGCGAGCTT GGCGCGCATC TCAATTCGGA AAAGGCGGTC
ATGGCGATGA AGCGGGGCGC GATCGCGGTC ATTACCGTGC CGACCACCGA GGACAGCGCG
CGCCGCCCGT GGGACAAGCG CGTCGCCATG TCGGACGGCC CGGCCAAGGG CTGGGTCGGC
GCCGACGGCA AGGCCTTTGC CCGCGCGACC GGGATCAGAG GCACGGCGAC GCTCAATCCC
GATGCCGCCG CGCCGCTCTT CGCCGCCTCG GGCAAGCCGC TGGCGAGGGT CCTGGCCGAG
GCGAACCGCA AGGGCGGCCG GCCCAGGGGC TTTGCGTTGA AGGCCAGGGC CAGCCTCTCG
TTCGCCAATG TCTGGAAGGA CGTGACCAGC GAGAACGTGG TCGCGGTCCT GCCGGGAAGC
GACGACAAGC TGGCGGGCGA GTTCGTCGGC ATGACCGCTC ACCTCGACCA CATCGGCATT
CACGGCAAGG GCGAGGACAC GCTGCACAAC GGCGCGATGG ACAACGCCTC GGGCGTGGCG
ACCATGCTGG AGGTGGCCAG GGCGGTTGCG CGAGACCGCC CGCGCCGCTC GGTCATGTTC
GCGGCGCTCA CCGGTGAGGA GGGCGGGCTG ATCGGCTCGG ACTACCTTGC GCGCAATCCG
CTGGTGAAAG GCGAAGTGGT CGGCCTCGTC AACTTCGACA TGCCGGTGCT GACCTACATG
TTCTCCGATG TCGTCGCGTT CGGGGCGGAG AACTCCACGA TGGGGCCGGT GGTGGCAGAG
GCCGCGAAGA AGGCGGGGAT CAAGCTTTCG CCCGACCCCA TGCCGGAAGA GGGCCTGTTC
ACCCGTTCGG ACCACTATCG CTTCGTCCAG CAGGGCGTGC CGGCGGTGTT CATGATGACC
GGCTTCGAAG GCCCCGGCGA GAAGGCGTTC CGCGACTTCC TGAAGACCAA TTACCACCAG
CCGAGCGACG ACTTGAAGTT GCCGTTCAAC TGGGAAGCGG GCGCGCTTTT CGCCAAGGTC
AACTACTACA CCGTGCTGGG TCTCGCGAAC GGCGACGAGC GGCCCCGGTG GTACGCCGGC
AGCTTCTTCG GCAAGGAGTT CGCGTCGGGC GCGGCGAAGG CGGCCGATCC GGCGAAGTAG
 
Protein sequence
MNVKLLVSAM LLASTAVPAL AEEPVFDPQA VRAHVTFLAD DLLEGRDTGS RGYDIAASYV 
ASQFVAMGLK PAGPDGSFYQ KLTVREARLD GAPKLALNYG GKETVLSDTA QVLVRPSLTD
KAVAVDAPLV FAGFGFDRPD LGFDDYKGLD AKGKIVVVLS GFPKGTPSEL GAHLNSEKAV
MAMKRGAIAV ITVPTTEDSA RRPWDKRVAM SDGPAKGWVG ADGKAFARAT GIRGTATLNP
DAAAPLFAAS GKPLARVLAE ANRKGGRPRG FALKARASLS FANVWKDVTS ENVVAVLPGS
DDKLAGEFVG MTAHLDHIGI HGKGEDTLHN GAMDNASGVA TMLEVARAVA RDRPRRSVMF
AALTGEEGGL IGSDYLARNP LVKGEVVGLV NFDMPVLTYM FSDVVAFGAE NSTMGPVVAE
AAKKAGIKLS PDPMPEEGLF TRSDHYRFVQ QGVPAVFMMT GFEGPGEKAF RDFLKTNYHQ
PSDDLKLPFN WEAGALFAKV NYYTVLGLAN GDERPRWYAG SFFGKEFASG AAKAADPAK