Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2591 |
Symbol | |
ID | 3917005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2797271 |
End bp | 2799415 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640445349 |
Product | dipeptidyl-peptidase 7 |
Protein accession | YP_497861 |
Protein GI | 87200604 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTTAT GTAATAGTGT ATCTTTCCAT CTGCCTTGCC CCGTGCGAAG GGGGAGATTG CGTTTGAAAC CAATTGTGCA GGAAAGGCTG TCGATAATGG CTGACCTTCG TATTCCGGCA TCGCTTGCCG CATTGGCAGG GCTCCTGCTT GCCGGACCGG CCCGCGCCGA CGAGGGCATG TGGACTTTCG ATGCCTTTCC GTCCGCGAAG ATGCAGGCAG ATTACGGCTG GGCGCCGGAT GGCAGGTGGC TGGACCGGGT GCAGGCGGCG GCGGTGCGGC TGACCGGTGG TTGTTCCGCC AGTTTCGTAT CGCCGGACGG GCTGATCCTG ACCAATCATC ACTGCGTGGT CGAGTGCGCG CAGGACAATT CGACGGACGA GAACGACTTG CTGAAGCTCG GCTTCGTGCC TGTCCGGCGC GAGGAGGAGC TGAAATGCCC CGGTCAGCAG GCCGAAGTGG TGACCGCGAT CGGGGACGTG ACCGCGCGCG TCAGGGCGGC GATCGGCACG GCCACAGGTG AAGCGCTGGT CAAGGCGCGC GATGCGGAGG CGGCCCGGAT CGAGAAGGAG GGATGCAGGG ACGCCGCGAC CACCCGGTGC GAGGTCGTCA CGCTGTTCGG CGGCGGGCAA TACAAGCTCT ATACCTACCG CAAGTATGCC GACGTGCGGC TGGCATGGGC GCCGCAGTTC CAGGCGGCGT TCTTCGGCGG CGATCCGGAC AACTTCAATT ATCCGCGCTA TGCGCTCGAT GCCGCGTTCC TTCGCGCCTA CGAGAACGGT CGGCCGGTGA AGGTGAAATC CTTCCTGAAG TGGAACCCGC GCGCGCCGCA AGTGGGCGAG GCGACGTTCG TCGTGGGCAA TCCCGGGTCG ACGCAACGGC TGTTTACCTC GGAACAGATC GCCTTCCAGC GCGAAGTGGG CCTGCCGTTG ACCACGACGA TCCTGTCGGA GCTGCGCGGT CGGCTGATCG GCGCAATGGA ACGCAGCCCC CAGGCGAAGC GCGAAGGCGC GGACGAACTG TTCGGGATCG AGAACAGCCT GAAGGTCTAT GTCGGGCGGC AGAAGGCGCT GAACGACCCG GCGTTCCTGA AGATGCTGGC CGAGGCCGAG GCGGATTTGA AAGCGAAGTC GCTGGGCAAG CCGGGTATCG GAGATCCCTG GGCCGACACG GCAAGGGCGG TGAAAGCCTA TCGCGATCTC TATGTGCCCT GGCGGTTCAT CGTGCCGTGG GGTTCGCTGA TGGGCTATGC GCAGACCATC GTTCAGGGCA CGGCCGAGCG CGAGAAGCCC GATGCCGATC GCCTGCCGGG CTATACGGAA AGCAACCTGG AGCTGACCAC CAAGACGCTG CTGGACGAAG CGCCGGTCTA TCCCTGGCTG GAACAGGTCG AGATGGCCTG GAGCCTTTCA AAGGCGCGCG AATACCTTGG CGCGGACGAT GCTGACACCC GGCTGCTGCT GGGCCGGGAA TCGCCTGAGG CCCTGGCCGA GCGACTTGTG GGTGGAACCA CGCTGGCGGA CCCGGCGGTG CGCCGCGCGC TGTGGGAAGG CGGGCGCAAG GCGGTGGAGG CATCGAGCGA TCCGATGATC GTCTATGCCC GCGCCATCGA CGCGCGCGAG CGGGAGCTCA AGAAACTGGT CGACGAACGC TACGCCGGTC CGCTGGCCGA GGCCGGGGCG ACGCTCGCCG ATGCGCGGTT CCTCGCTTAT GGCGACAGGA TCTACCCGGA TGCGACGTTC ACCCTGCGCA TCAGCTATGG CAAGGTGCAG GGCTGGAAGG AGCGCGGCAT TGATGTGGCG GCGGTGACCA CGCTGGGCGG CGCGTTCGAG CGGGCGACGG GGGCCGAACC GTTCGATCTG GCCAGCGCCT TTGCCGCGAA CGAGGCACGG ATCGACAAGG CGGTGCCGTT CGATTTCGTC ACGACCAACG ACATCATCGG CGGCAATTCC GGATCACCGG TAATCGACAG GTCCGGGACG GTGATCGGTG CGGCGTTTGA CGGGAACATC CATTCCATCG GCGGCAACTA TGGCTACCAG GGGGACGTCA ACCGGACGGT GGTCGTCAGC GCGGCGGCGG TGCAGCACGC GCTGGAAGTG ATCTATCCGG CGCCGGCACT GGTGAAGGAA TTGCGGGGCA AGTGA
|
Protein sequence | MGLCNSVSFH LPCPVRRGRL RLKPIVQERL SIMADLRIPA SLAALAGLLL AGPARADEGM WTFDAFPSAK MQADYGWAPD GRWLDRVQAA AVRLTGGCSA SFVSPDGLIL TNHHCVVECA QDNSTDENDL LKLGFVPVRR EEELKCPGQQ AEVVTAIGDV TARVRAAIGT ATGEALVKAR DAEAARIEKE GCRDAATTRC EVVTLFGGGQ YKLYTYRKYA DVRLAWAPQF QAAFFGGDPD NFNYPRYALD AAFLRAYENG RPVKVKSFLK WNPRAPQVGE ATFVVGNPGS TQRLFTSEQI AFQREVGLPL TTTILSELRG RLIGAMERSP QAKREGADEL FGIENSLKVY VGRQKALNDP AFLKMLAEAE ADLKAKSLGK PGIGDPWADT ARAVKAYRDL YVPWRFIVPW GSLMGYAQTI VQGTAEREKP DADRLPGYTE SNLELTTKTL LDEAPVYPWL EQVEMAWSLS KAREYLGADD ADTRLLLGRE SPEALAERLV GGTTLADPAV RRALWEGGRK AVEASSDPMI VYARAIDARE RELKKLVDER YAGPLAEAGA TLADARFLAY GDRIYPDATF TLRISYGKVQ GWKERGIDVA AVTTLGGAFE RATGAEPFDL ASAFAANEAR IDKAVPFDFV TTNDIIGGNS GSPVIDRSGT VIGAAFDGNI HSIGGNYGYQ GDVNRTVVVS AAAVQHALEV IYPAPALVKE LRGK
|
| |