Gene Saro_1409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1409 
Symbol 
ID3916073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1451278 
End bp1453542 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content64% 
IMG OID640444152 
Productdipeptidyl-peptidase IV 
Protein accessionYP_496687 
Protein GI87199430 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCATT TCCGCCTGCT TGCTGCCTGC GCGCTGTCTT CACTTGCCAT TGCACCACTG 
GCCCACGCCC AACAGGGACA ATCGATGACC GCCACTGCCG CGCCCGCCGA AGCCGGAGCC
CTGACTTTCG AACGCGTGTT CGCCAGCCCG AGCCTCAACG GTCTGGCCCC GCGCGCGGTC
AAGCTGTCGC CCGATGGCCG TTACCTGACG CTGCTGCGCA ACCGCGCCGA CGACCGCGAG
CGTTATGACC TGTGGGGCTT TGACCGCCAG ACCGGCGAGT GGAAGATGCT GGTGGATTCG
CTCAAGCTCT CGTCGGGCCG CCAGTTGACC GAAGCCGAGA AGATGCAGCG TGAGCGCCAG
CGCATCGGCG ATCTCAAGGG CATCGTGTCC TACGAGTGGT CGGCGGACAG CAAGTCGGTG
CTGGTGCCGG TGGACGGAGA CCTGCTGCTG GCCGGTCTCG ACGGTTCGGT CCGCAAGGTG
GAAGGCACCA AGGGCGGCGA GCTTACGCCC AAGCTTGGGC CCAAGGGCGA ACACATCGCA
TTCGTGCGCG ACAGGCGGCT GTGGGCCGGG CCGGTGACCG GCACCGCCGC CGTGGCGATC
ACGCCCGAAG AGGCCAATGC GGACGTTCAC TGGGGTGAGG CCGAGTTCGT CGCGCAGGAG
GAAATGAACC GCTTCAACGG CTTCTGGTGG TCGCCCGACG AAAGCCGCAT CGCGGTCGAG
CGCTTCGACG AGAGCATGGT GGGCGTGGTC ACCCGCGCGG CCATCGGCGC GGAAGGGACG
AAGACCTTCG ACCAGCGCTA TCCGGCGGCG GGCACGCCCA ATGCGGAAGT CTCGCTTTAC
GTGATTGGGC CGGACGGTTC CAACCGGGTG CAGGTCGATC TCGGCGCCAA CAAGGACATC
TATCTCGCCC GCGTGGACTG GGCACCTGAC GGCAAGACGC TCTACGTCCA GCGCATGAAC
CGCGAGCAGA CCGTGCTCGA CATGCTCAAG GTCGATCCCG TGACGGGGAA GTCGAGCGTG
CTGTTCAGCG AGAAGGCCGC GGCGAAGCAC TGGATCGACC TTTCGGACAG CTATCGGTTT
CTGGCCGACG GCAGCCTGAT CTGGTGGTCG CAGCGCGACG GGTTCGGGCA CCTCTACCGC
TTCAAGAACG GGAAGTGGAG CCAGCTTACC AAGGGTGAGT GGGTCGTGAC CGGGCTTGTC
GGCGTCGACG AGAAGGGCGG CAAGCTCTAC CTTGCCGGGA CCAAGGACGA CGTACTGGCG
CCGCAGGTCT ATGCGATGGA CCTCAAGGCG CCGGGCAAGC TCACGCGGCT GACCGAGCTT
GGCTGGGTCA ACGGGGCTAG CATGGACAAG AGCGGGCAGA CGCTGATGAT CACGCGTTCG
TCGGATGCGC AGCCGGCCCA GTCCTACATC GCCGACACTG CCGGCAAGAA CCTCGCCTGG
ATCGAGGAGA ACAAGGTCGC GGGCTCGCAC CCCTATGCGC CCTATCTGGC CAGCCATCGC
CCGGCGCAGT TCGGCACCAT CCCGGCTGCC GATGGCACAC CGCTGCACTA CATGATGATC
ACTCCGCCGC TGGAGCCGGG CAAGAAGTAT CCGGTGTTCA CCTACCATTA CGGCGGGCCG
ACCGCGCAGG TGGTGACCAA GGGCTTCCAG GGGGCGCTGG CGCAGGCAAT CGTCGACAAA
GGCTATATCT ATTTCGCCAT CGACAATCGC GGCTCTGAAA ACCGCGGCGT CAAGTTCGCT
TCCGCGTTGC ATCACGCGAT GGGATCGGTC GAGGTCGAGG ATCAGCTCGC GGGGGCGAAC
TGGCTCAAGA AGCAGGCGTT CGTCGATGCC GACAAGATCA GCACGTTCGG CTGGTCCTAT
GGCGGATACA TGTCGATCAA GATGCTCGAG GCAAATCCGG GGGCCTATGC AGCTGGCATC
GCCGTCGCGC CCGTGACCAA GTGGCAGATG TACGACACCA CTTATACCGA GCGCTACCTT
GGCGACCCCG GCAAGCTGCC GGAGGTCTAC GAGAAGGCGA ACGCCCTGGC CGATACGGGC
AAGATCAGCG ATCCGCTGCT GATCATCCAC GGCATGGCCG ACGACAACGT GGTGTTCGAG
AACGCCAGCG CCATCATCGC CAAAATGCAG GCCGAGGCGG TGCCGTTCGA GATGATGCTT
TATCCCGGCT ACACCCACCG CATCAGCGGA CCGAAGGTGA GCCAGCATTT GTACGAGACG
ATTTTCCGCT TTCTCGACCG TAATGGAGCG GGGAGCGGAA AGTAG
 
Protein sequence
MRHFRLLAAC ALSSLAIAPL AHAQQGQSMT ATAAPAEAGA LTFERVFASP SLNGLAPRAV 
KLSPDGRYLT LLRNRADDRE RYDLWGFDRQ TGEWKMLVDS LKLSSGRQLT EAEKMQRERQ
RIGDLKGIVS YEWSADSKSV LVPVDGDLLL AGLDGSVRKV EGTKGGELTP KLGPKGEHIA
FVRDRRLWAG PVTGTAAVAI TPEEANADVH WGEAEFVAQE EMNRFNGFWW SPDESRIAVE
RFDESMVGVV TRAAIGAEGT KTFDQRYPAA GTPNAEVSLY VIGPDGSNRV QVDLGANKDI
YLARVDWAPD GKTLYVQRMN REQTVLDMLK VDPVTGKSSV LFSEKAAAKH WIDLSDSYRF
LADGSLIWWS QRDGFGHLYR FKNGKWSQLT KGEWVVTGLV GVDEKGGKLY LAGTKDDVLA
PQVYAMDLKA PGKLTRLTEL GWVNGASMDK SGQTLMITRS SDAQPAQSYI ADTAGKNLAW
IEENKVAGSH PYAPYLASHR PAQFGTIPAA DGTPLHYMMI TPPLEPGKKY PVFTYHYGGP
TAQVVTKGFQ GALAQAIVDK GYIYFAIDNR GSENRGVKFA SALHHAMGSV EVEDQLAGAN
WLKKQAFVDA DKISTFGWSY GGYMSIKMLE ANPGAYAAGI AVAPVTKWQM YDTTYTERYL
GDPGKLPEVY EKANALADTG KISDPLLIIH GMADDNVVFE NASAIIAKMQ AEAVPFEMML
YPGYTHRISG PKVSQHLYET IFRFLDRNGA GSGK