Gene Saro_2591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2591 
Symbol 
ID3917005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2797271 
End bp2799415 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content65% 
IMG OID640445349 
Productdipeptidyl-peptidase 7 
Protein accessionYP_497861 
Protein GI87200604 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTTAT GTAATAGTGT ATCTTTCCAT CTGCCTTGCC CCGTGCGAAG GGGGAGATTG 
CGTTTGAAAC CAATTGTGCA GGAAAGGCTG TCGATAATGG CTGACCTTCG TATTCCGGCA
TCGCTTGCCG CATTGGCAGG GCTCCTGCTT GCCGGACCGG CCCGCGCCGA CGAGGGCATG
TGGACTTTCG ATGCCTTTCC GTCCGCGAAG ATGCAGGCAG ATTACGGCTG GGCGCCGGAT
GGCAGGTGGC TGGACCGGGT GCAGGCGGCG GCGGTGCGGC TGACCGGTGG TTGTTCCGCC
AGTTTCGTAT CGCCGGACGG GCTGATCCTG ACCAATCATC ACTGCGTGGT CGAGTGCGCG
CAGGACAATT CGACGGACGA GAACGACTTG CTGAAGCTCG GCTTCGTGCC TGTCCGGCGC
GAGGAGGAGC TGAAATGCCC CGGTCAGCAG GCCGAAGTGG TGACCGCGAT CGGGGACGTG
ACCGCGCGCG TCAGGGCGGC GATCGGCACG GCCACAGGTG AAGCGCTGGT CAAGGCGCGC
GATGCGGAGG CGGCCCGGAT CGAGAAGGAG GGATGCAGGG ACGCCGCGAC CACCCGGTGC
GAGGTCGTCA CGCTGTTCGG CGGCGGGCAA TACAAGCTCT ATACCTACCG CAAGTATGCC
GACGTGCGGC TGGCATGGGC GCCGCAGTTC CAGGCGGCGT TCTTCGGCGG CGATCCGGAC
AACTTCAATT ATCCGCGCTA TGCGCTCGAT GCCGCGTTCC TTCGCGCCTA CGAGAACGGT
CGGCCGGTGA AGGTGAAATC CTTCCTGAAG TGGAACCCGC GCGCGCCGCA AGTGGGCGAG
GCGACGTTCG TCGTGGGCAA TCCCGGGTCG ACGCAACGGC TGTTTACCTC GGAACAGATC
GCCTTCCAGC GCGAAGTGGG CCTGCCGTTG ACCACGACGA TCCTGTCGGA GCTGCGCGGT
CGGCTGATCG GCGCAATGGA ACGCAGCCCC CAGGCGAAGC GCGAAGGCGC GGACGAACTG
TTCGGGATCG AGAACAGCCT GAAGGTCTAT GTCGGGCGGC AGAAGGCGCT GAACGACCCG
GCGTTCCTGA AGATGCTGGC CGAGGCCGAG GCGGATTTGA AAGCGAAGTC GCTGGGCAAG
CCGGGTATCG GAGATCCCTG GGCCGACACG GCAAGGGCGG TGAAAGCCTA TCGCGATCTC
TATGTGCCCT GGCGGTTCAT CGTGCCGTGG GGTTCGCTGA TGGGCTATGC GCAGACCATC
GTTCAGGGCA CGGCCGAGCG CGAGAAGCCC GATGCCGATC GCCTGCCGGG CTATACGGAA
AGCAACCTGG AGCTGACCAC CAAGACGCTG CTGGACGAAG CGCCGGTCTA TCCCTGGCTG
GAACAGGTCG AGATGGCCTG GAGCCTTTCA AAGGCGCGCG AATACCTTGG CGCGGACGAT
GCTGACACCC GGCTGCTGCT GGGCCGGGAA TCGCCTGAGG CCCTGGCCGA GCGACTTGTG
GGTGGAACCA CGCTGGCGGA CCCGGCGGTG CGCCGCGCGC TGTGGGAAGG CGGGCGCAAG
GCGGTGGAGG CATCGAGCGA TCCGATGATC GTCTATGCCC GCGCCATCGA CGCGCGCGAG
CGGGAGCTCA AGAAACTGGT CGACGAACGC TACGCCGGTC CGCTGGCCGA GGCCGGGGCG
ACGCTCGCCG ATGCGCGGTT CCTCGCTTAT GGCGACAGGA TCTACCCGGA TGCGACGTTC
ACCCTGCGCA TCAGCTATGG CAAGGTGCAG GGCTGGAAGG AGCGCGGCAT TGATGTGGCG
GCGGTGACCA CGCTGGGCGG CGCGTTCGAG CGGGCGACGG GGGCCGAACC GTTCGATCTG
GCCAGCGCCT TTGCCGCGAA CGAGGCACGG ATCGACAAGG CGGTGCCGTT CGATTTCGTC
ACGACCAACG ACATCATCGG CGGCAATTCC GGATCACCGG TAATCGACAG GTCCGGGACG
GTGATCGGTG CGGCGTTTGA CGGGAACATC CATTCCATCG GCGGCAACTA TGGCTACCAG
GGGGACGTCA ACCGGACGGT GGTCGTCAGC GCGGCGGCGG TGCAGCACGC GCTGGAAGTG
ATCTATCCGG CGCCGGCACT GGTGAAGGAA TTGCGGGGCA AGTGA
 
Protein sequence
MGLCNSVSFH LPCPVRRGRL RLKPIVQERL SIMADLRIPA SLAALAGLLL AGPARADEGM 
WTFDAFPSAK MQADYGWAPD GRWLDRVQAA AVRLTGGCSA SFVSPDGLIL TNHHCVVECA
QDNSTDENDL LKLGFVPVRR EEELKCPGQQ AEVVTAIGDV TARVRAAIGT ATGEALVKAR
DAEAARIEKE GCRDAATTRC EVVTLFGGGQ YKLYTYRKYA DVRLAWAPQF QAAFFGGDPD
NFNYPRYALD AAFLRAYENG RPVKVKSFLK WNPRAPQVGE ATFVVGNPGS TQRLFTSEQI
AFQREVGLPL TTTILSELRG RLIGAMERSP QAKREGADEL FGIENSLKVY VGRQKALNDP
AFLKMLAEAE ADLKAKSLGK PGIGDPWADT ARAVKAYRDL YVPWRFIVPW GSLMGYAQTI
VQGTAEREKP DADRLPGYTE SNLELTTKTL LDEAPVYPWL EQVEMAWSLS KAREYLGADD
ADTRLLLGRE SPEALAERLV GGTTLADPAV RRALWEGGRK AVEASSDPMI VYARAIDARE
RELKKLVDER YAGPLAEAGA TLADARFLAY GDRIYPDATF TLRISYGKVQ GWKERGIDVA
AVTTLGGAFE RATGAEPFDL ASAFAANEAR IDKAVPFDFV TTNDIIGGNS GSPVIDRSGT
VIGAAFDGNI HSIGGNYGYQ GDVNRTVVVS AAAVQHALEV IYPAPALVKE LRGK