Gene Saro_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0844 
Symbol 
ID3915899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp893966 
End bp895585 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content65% 
IMG OID640443576 
Productpeptidase M28 
Protein accessionYP_496123 
Protein GI87198866 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCTT GCCTGCGCGG GGCCGCGAGC GCCGCCGCCA TTGCCCTTTG CACGTTCGCC 
GCCCCGGCCA TTGCCGGACC CGCCGAGGAC AAGCTGATCG CGCAGCTCCT GGACGAGGGA
TTGAACCGTT CCGATGCGAT GGAGATCGCG TCGGAGCTGA TGGACCGCAT CGGGCCGCGT
CTCACCAATT CCGAAAATCA TCGCAAGGCG GAGGACTGGG CTGCAGCCAA GTTCGCCTCG
TTCGGGCTCA AGAACATCCA CCGCGAACCG TTCGAGTTCG GGCTTGGCTG GAACCTCAAG
TCCTATTCGG CGACGATGAC CTCGCCGCGC AGCCTGCCCC TCACCGTCCT GCCGGTCGCA
TGGTCGCCGC CCACCGGCGG CACGATCACC GCCCCGGTCA TCGTCGCGCC GATGACCAAG
GTCGAGAATT TCGATGCGTG GAAGGGCAAG CTGGCCGGCC GGATCGTGCT GGTCAGCCTG
CCTGGCGAGA CCAGCGCGTC CGCAGATCCG GTGTTCGAGC GCCTCTCGGG CGAAGAGATC
GGCAAGCTCG ACAAGTATAC CTTGCCGCGC CACGACCCCG AAGGTCTGGC CATGCAGGTC
GCCCGGCGTG GATTTGCCCG CAAGCTGTCG GAGTTCCTCA AGGCGGAAGG CGCGTTGGCC
ATGGTCCGCA TGACCTATCG CGATGGCAAG CTGGTCCATG GCGAGGGCTA TGACTTCGTG
CCGGGCCAGA CCCTTGCCGT GCCGGCGATG GACATGGCGC AGGAAGACTA TCGCCGCCTC
GTCCGCCTGG AGAAGACGGG TGCCGCCCCG CAGCTCTCGC TCAGCATCGA CGCGAGCTTC
GACGACAAGG ATCTGATGGC GGACAACGTC ATTGCCGAGA TCCCCGGCAG CGATCCCAAG
GCGGGCTACG TCATGGCGGG TGCGCACTTC GACAGCTGGA TCGCGGGCGA CGGCGCATCC
GACAATGGAG CGGGAAGCGT CGCCGTGATC GAGGCTGCAC GCCTGCTCTC GAAAATGGGC
GTCAAGCCGA AGCGCACCAT CCGCTTTGCG CTGTGGAGCG GGGAGGAGCA GGGGCTGCTG
GGCTCGAAGG CCTATATCGA GCAGCATCTC GCCACCCGGC CGGTCGACCC GGCGCTAAAG
GGCATCGACA GCTATTCGGC ATGGCGCAAT GCCTATCCGA TCACGCCCAA GCCGGGCTAT
TCCCAGCTCA AGGCCTATTT CAACATGGAC AACGGTTCGG GCAAGTTCCG CGGCATTTAT
GCCGAGGGCA ACGTTGCCGC CGCGCCGATC CTCAGGGAAT GGCTCGCGCC CTTCAGCTCG
CTCGGCGCGG ACAAGGTGGT GATGAGCAAG ACGGGCGGGA CCGACCACGT CTATCTCCAG
GCAATCGGCC TGCCGGGCTA CCAGTTCATC CAGGACCCGC TCGATTACGA GAGCCGCGTG
CACCACTCCA GCCTCGACAC GCTTGACCAC ATGCGCGCCG ACGACATGAG GCAGGCCTCC
GTCATTCTGG CGGGAATGCT GCTCCAGGCG GCGACAAGCG AGAAGGAACT GCCGCGCTCG
CCGTTGCCGA CCAAGCCGGA CGCGACCGAT CCGTTCAAGG TGCAGGACCC CAACCAGTAG
 
Protein sequence
MTSCLRGAAS AAAIALCTFA APAIAGPAED KLIAQLLDEG LNRSDAMEIA SELMDRIGPR 
LTNSENHRKA EDWAAAKFAS FGLKNIHREP FEFGLGWNLK SYSATMTSPR SLPLTVLPVA
WSPPTGGTIT APVIVAPMTK VENFDAWKGK LAGRIVLVSL PGETSASADP VFERLSGEEI
GKLDKYTLPR HDPEGLAMQV ARRGFARKLS EFLKAEGALA MVRMTYRDGK LVHGEGYDFV
PGQTLAVPAM DMAQEDYRRL VRLEKTGAAP QLSLSIDASF DDKDLMADNV IAEIPGSDPK
AGYVMAGAHF DSWIAGDGAS DNGAGSVAVI EAARLLSKMG VKPKRTIRFA LWSGEEQGLL
GSKAYIEQHL ATRPVDPALK GIDSYSAWRN AYPITPKPGY SQLKAYFNMD NGSGKFRGIY
AEGNVAAAPI LREWLAPFSS LGADKVVMSK TGGTDHVYLQ AIGLPGYQFI QDPLDYESRV
HHSSLDTLDH MRADDMRQAS VILAGMLLQA ATSEKELPRS PLPTKPDATD PFKVQDPNQ