Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1409 |
Symbol | |
ID | 3916073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1451278 |
End bp | 1453542 |
Gene Length | 2265 bp |
Protein Length | 754 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444152 |
Product | dipeptidyl-peptidase IV |
Protein accession | YP_496687 |
Protein GI | 87199430 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCATT TCCGCCTGCT TGCTGCCTGC GCGCTGTCTT CACTTGCCAT TGCACCACTG GCCCACGCCC AACAGGGACA ATCGATGACC GCCACTGCCG CGCCCGCCGA AGCCGGAGCC CTGACTTTCG AACGCGTGTT CGCCAGCCCG AGCCTCAACG GTCTGGCCCC GCGCGCGGTC AAGCTGTCGC CCGATGGCCG TTACCTGACG CTGCTGCGCA ACCGCGCCGA CGACCGCGAG CGTTATGACC TGTGGGGCTT TGACCGCCAG ACCGGCGAGT GGAAGATGCT GGTGGATTCG CTCAAGCTCT CGTCGGGCCG CCAGTTGACC GAAGCCGAGA AGATGCAGCG TGAGCGCCAG CGCATCGGCG ATCTCAAGGG CATCGTGTCC TACGAGTGGT CGGCGGACAG CAAGTCGGTG CTGGTGCCGG TGGACGGAGA CCTGCTGCTG GCCGGTCTCG ACGGTTCGGT CCGCAAGGTG GAAGGCACCA AGGGCGGCGA GCTTACGCCC AAGCTTGGGC CCAAGGGCGA ACACATCGCA TTCGTGCGCG ACAGGCGGCT GTGGGCCGGG CCGGTGACCG GCACCGCCGC CGTGGCGATC ACGCCCGAAG AGGCCAATGC GGACGTTCAC TGGGGTGAGG CCGAGTTCGT CGCGCAGGAG GAAATGAACC GCTTCAACGG CTTCTGGTGG TCGCCCGACG AAAGCCGCAT CGCGGTCGAG CGCTTCGACG AGAGCATGGT GGGCGTGGTC ACCCGCGCGG CCATCGGCGC GGAAGGGACG AAGACCTTCG ACCAGCGCTA TCCGGCGGCG GGCACGCCCA ATGCGGAAGT CTCGCTTTAC GTGATTGGGC CGGACGGTTC CAACCGGGTG CAGGTCGATC TCGGCGCCAA CAAGGACATC TATCTCGCCC GCGTGGACTG GGCACCTGAC GGCAAGACGC TCTACGTCCA GCGCATGAAC CGCGAGCAGA CCGTGCTCGA CATGCTCAAG GTCGATCCCG TGACGGGGAA GTCGAGCGTG CTGTTCAGCG AGAAGGCCGC GGCGAAGCAC TGGATCGACC TTTCGGACAG CTATCGGTTT CTGGCCGACG GCAGCCTGAT CTGGTGGTCG CAGCGCGACG GGTTCGGGCA CCTCTACCGC TTCAAGAACG GGAAGTGGAG CCAGCTTACC AAGGGTGAGT GGGTCGTGAC CGGGCTTGTC GGCGTCGACG AGAAGGGCGG CAAGCTCTAC CTTGCCGGGA CCAAGGACGA CGTACTGGCG CCGCAGGTCT ATGCGATGGA CCTCAAGGCG CCGGGCAAGC TCACGCGGCT GACCGAGCTT GGCTGGGTCA ACGGGGCTAG CATGGACAAG AGCGGGCAGA CGCTGATGAT CACGCGTTCG TCGGATGCGC AGCCGGCCCA GTCCTACATC GCCGACACTG CCGGCAAGAA CCTCGCCTGG ATCGAGGAGA ACAAGGTCGC GGGCTCGCAC CCCTATGCGC CCTATCTGGC CAGCCATCGC CCGGCGCAGT TCGGCACCAT CCCGGCTGCC GATGGCACAC CGCTGCACTA CATGATGATC ACTCCGCCGC TGGAGCCGGG CAAGAAGTAT CCGGTGTTCA CCTACCATTA CGGCGGGCCG ACCGCGCAGG TGGTGACCAA GGGCTTCCAG GGGGCGCTGG CGCAGGCAAT CGTCGACAAA GGCTATATCT ATTTCGCCAT CGACAATCGC GGCTCTGAAA ACCGCGGCGT CAAGTTCGCT TCCGCGTTGC ATCACGCGAT GGGATCGGTC GAGGTCGAGG ATCAGCTCGC GGGGGCGAAC TGGCTCAAGA AGCAGGCGTT CGTCGATGCC GACAAGATCA GCACGTTCGG CTGGTCCTAT GGCGGATACA TGTCGATCAA GATGCTCGAG GCAAATCCGG GGGCCTATGC AGCTGGCATC GCCGTCGCGC CCGTGACCAA GTGGCAGATG TACGACACCA CTTATACCGA GCGCTACCTT GGCGACCCCG GCAAGCTGCC GGAGGTCTAC GAGAAGGCGA ACGCCCTGGC CGATACGGGC AAGATCAGCG ATCCGCTGCT GATCATCCAC GGCATGGCCG ACGACAACGT GGTGTTCGAG AACGCCAGCG CCATCATCGC CAAAATGCAG GCCGAGGCGG TGCCGTTCGA GATGATGCTT TATCCCGGCT ACACCCACCG CATCAGCGGA CCGAAGGTGA GCCAGCATTT GTACGAGACG ATTTTCCGCT TTCTCGACCG TAATGGAGCG GGGAGCGGAA AGTAG
|
Protein sequence | MRHFRLLAAC ALSSLAIAPL AHAQQGQSMT ATAAPAEAGA LTFERVFASP SLNGLAPRAV KLSPDGRYLT LLRNRADDRE RYDLWGFDRQ TGEWKMLVDS LKLSSGRQLT EAEKMQRERQ RIGDLKGIVS YEWSADSKSV LVPVDGDLLL AGLDGSVRKV EGTKGGELTP KLGPKGEHIA FVRDRRLWAG PVTGTAAVAI TPEEANADVH WGEAEFVAQE EMNRFNGFWW SPDESRIAVE RFDESMVGVV TRAAIGAEGT KTFDQRYPAA GTPNAEVSLY VIGPDGSNRV QVDLGANKDI YLARVDWAPD GKTLYVQRMN REQTVLDMLK VDPVTGKSSV LFSEKAAAKH WIDLSDSYRF LADGSLIWWS QRDGFGHLYR FKNGKWSQLT KGEWVVTGLV GVDEKGGKLY LAGTKDDVLA PQVYAMDLKA PGKLTRLTEL GWVNGASMDK SGQTLMITRS SDAQPAQSYI ADTAGKNLAW IEENKVAGSH PYAPYLASHR PAQFGTIPAA DGTPLHYMMI TPPLEPGKKY PVFTYHYGGP TAQVVTKGFQ GALAQAIVDK GYIYFAIDNR GSENRGVKFA SALHHAMGSV EVEDQLAGAN WLKKQAFVDA DKISTFGWSY GGYMSIKMLE ANPGAYAAGI AVAPVTKWQM YDTTYTERYL GDPGKLPEVY EKANALADTG KISDPLLIIH GMADDNVVFE NASAIIAKMQ AEAVPFEMML YPGYTHRISG PKVSQHLYET IFRFLDRNGA GSGK
|
| |