Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0844 |
Symbol | |
ID | 3915899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 893966 |
End bp | 895585 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640443576 |
Product | peptidase M28 |
Protein accession | YP_496123 |
Protein GI | 87198866 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCTT GCCTGCGCGG GGCCGCGAGC GCCGCCGCCA TTGCCCTTTG CACGTTCGCC GCCCCGGCCA TTGCCGGACC CGCCGAGGAC AAGCTGATCG CGCAGCTCCT GGACGAGGGA TTGAACCGTT CCGATGCGAT GGAGATCGCG TCGGAGCTGA TGGACCGCAT CGGGCCGCGT CTCACCAATT CCGAAAATCA TCGCAAGGCG GAGGACTGGG CTGCAGCCAA GTTCGCCTCG TTCGGGCTCA AGAACATCCA CCGCGAACCG TTCGAGTTCG GGCTTGGCTG GAACCTCAAG TCCTATTCGG CGACGATGAC CTCGCCGCGC AGCCTGCCCC TCACCGTCCT GCCGGTCGCA TGGTCGCCGC CCACCGGCGG CACGATCACC GCCCCGGTCA TCGTCGCGCC GATGACCAAG GTCGAGAATT TCGATGCGTG GAAGGGCAAG CTGGCCGGCC GGATCGTGCT GGTCAGCCTG CCTGGCGAGA CCAGCGCGTC CGCAGATCCG GTGTTCGAGC GCCTCTCGGG CGAAGAGATC GGCAAGCTCG ACAAGTATAC CTTGCCGCGC CACGACCCCG AAGGTCTGGC CATGCAGGTC GCCCGGCGTG GATTTGCCCG CAAGCTGTCG GAGTTCCTCA AGGCGGAAGG CGCGTTGGCC ATGGTCCGCA TGACCTATCG CGATGGCAAG CTGGTCCATG GCGAGGGCTA TGACTTCGTG CCGGGCCAGA CCCTTGCCGT GCCGGCGATG GACATGGCGC AGGAAGACTA TCGCCGCCTC GTCCGCCTGG AGAAGACGGG TGCCGCCCCG CAGCTCTCGC TCAGCATCGA CGCGAGCTTC GACGACAAGG ATCTGATGGC GGACAACGTC ATTGCCGAGA TCCCCGGCAG CGATCCCAAG GCGGGCTACG TCATGGCGGG TGCGCACTTC GACAGCTGGA TCGCGGGCGA CGGCGCATCC GACAATGGAG CGGGAAGCGT CGCCGTGATC GAGGCTGCAC GCCTGCTCTC GAAAATGGGC GTCAAGCCGA AGCGCACCAT CCGCTTTGCG CTGTGGAGCG GGGAGGAGCA GGGGCTGCTG GGCTCGAAGG CCTATATCGA GCAGCATCTC GCCACCCGGC CGGTCGACCC GGCGCTAAAG GGCATCGACA GCTATTCGGC ATGGCGCAAT GCCTATCCGA TCACGCCCAA GCCGGGCTAT TCCCAGCTCA AGGCCTATTT CAACATGGAC AACGGTTCGG GCAAGTTCCG CGGCATTTAT GCCGAGGGCA ACGTTGCCGC CGCGCCGATC CTCAGGGAAT GGCTCGCGCC CTTCAGCTCG CTCGGCGCGG ACAAGGTGGT GATGAGCAAG ACGGGCGGGA CCGACCACGT CTATCTCCAG GCAATCGGCC TGCCGGGCTA CCAGTTCATC CAGGACCCGC TCGATTACGA GAGCCGCGTG CACCACTCCA GCCTCGACAC GCTTGACCAC ATGCGCGCCG ACGACATGAG GCAGGCCTCC GTCATTCTGG CGGGAATGCT GCTCCAGGCG GCGACAAGCG AGAAGGAACT GCCGCGCTCG CCGTTGCCGA CCAAGCCGGA CGCGACCGAT CCGTTCAAGG TGCAGGACCC CAACCAGTAG
|
Protein sequence | MTSCLRGAAS AAAIALCTFA APAIAGPAED KLIAQLLDEG LNRSDAMEIA SELMDRIGPR LTNSENHRKA EDWAAAKFAS FGLKNIHREP FEFGLGWNLK SYSATMTSPR SLPLTVLPVA WSPPTGGTIT APVIVAPMTK VENFDAWKGK LAGRIVLVSL PGETSASADP VFERLSGEEI GKLDKYTLPR HDPEGLAMQV ARRGFARKLS EFLKAEGALA MVRMTYRDGK LVHGEGYDFV PGQTLAVPAM DMAQEDYRRL VRLEKTGAAP QLSLSIDASF DDKDLMADNV IAEIPGSDPK AGYVMAGAHF DSWIAGDGAS DNGAGSVAVI EAARLLSKMG VKPKRTIRFA LWSGEEQGLL GSKAYIEQHL ATRPVDPALK GIDSYSAWRN AYPITPKPGY SQLKAYFNMD NGSGKFRGIY AEGNVAAAPI LREWLAPFSS LGADKVVMSK TGGTDHVYLQ AIGLPGYQFI QDPLDYESRV HHSSLDTLDH MRADDMRQAS VILAGMLLQA ATSEKELPRS PLPTKPDATD PFKVQDPNQ
|
| |