Gene SO_3990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_3990 
Symbol 
ID1171621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp4127030 
End bp4129321 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content49% 
IMG OID637345741 
Productdipeptidyl peptidase IV 
Protein accessionNP_719520 
Protein GI24375477 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAA ATGGGTTAGC TTCCTCCCTG CGCGCCGTGA AGTTGGGCGC TTCATCCCTA 
TTGATTGCGA GTCAACTGGC CATGATTTCT ACCCTTACCA CAGCTTCTGC CTATGCGCTT
GAGGGCGGTA TGACGCCACT CACTATCGAG CGGATGAATG CTTCTCCCGC CTTGGCGGGC
ACCAGTCCGC GCGGATTAAA GCTATCCCCC GATGGCCAAA AGGTGACCTA TCTAGCGGGG
CGTAAAGATA ACCAAAACTT TTACGATCTT TGGCAAATGG ATGTTAAAAC AGGAAAATCT
AGCCTGTTGC TGAATGCCGA TAAACTGGCG AGTAATGAAC TGTCCGATGA AGAAAAGGCC
CGCCGCGAAC GCCAACGGAT TTATGGCGAA GGCATTATGG AATATTTCTG GGCCGACGAT
AGCAAAGCGC TGTTGATCCC CGCTGCGGGC AATTTGTATT ACTTCTCCTT GGCGGATAAC
AGCGTCAGCC TATTGCCGAT TGGTGAAGGT TTTGCCACTG ATGCCCGCCT ATCGCCCAAG
GGTAACTTTG TATCCTTCGT TCGTGATCAA AATCTGTATG TACTTAATCT TGCAACTAAA
AAGCTTGAGG CCATGACCAC CGATGGTGGC GGCGTGATTA AAAATGCCAT GGCAGAATTT
GTGGCGCAGG AAGAAATGGA TCGCATGACA GGTTACTGGT GGGCGCCCGA TGAGTCGGCC
ATTGCCTTTA CTCGTATTGA TGAATCCGCT GTTGAGTTAG TCACCCGCAA TGAAATCTAT
GCCGATGGCA TTAAACTCAC CGAGCAGCGT TATCCGGCGG CAGGTAAAAA CAATGTCGAG
ATCCAACTCG GTGTAGTAAC GCTGAAAAAT AAAGCCATCG ATTGGGTAAC CTTAAGCGAT
GATAAAAATA AGGATATTTA CCTGCCTCGC GTCGATTGGT TGCCGGATAG CAAGCATTTA
TCCTTTCAAT GGCAGAGCCG CGATCAACAA AAGTTAGACC TGCAACTGGT TGCCTTAGAT
TCGCTGACCA AGCCGAAAAC CTTAGTCAAA GAGCGCAGCG ATGCTTGGGT GAATCTTAAC
AACGACTTGC ACTTCTTAAA GCAACAATCG GCTTTTATTT GGGCCTCGGA GCGTGATGGC
TTTAATCATC TCTATTTGTT TGACCTAAAA GGTAAGCTGA AGACGCAACT GACCAAGGGC
AATTGGGCTG TCGATGAGCT GGAATTTATC GATGAAACCG CAGGTTGGGT GTATTTCACT
GGCCGCAAAG ATACCCCAAT CGAAAAACAC CTTTACCGTG TGCCGTTAGC GGGCGGAAAT
ATTGAGCGCG TGAGCAGTGA GGCGGGGATG CACGATCCGG TATTTGCCGA TAATCAAAGC
GTGTATCTCG ATTATTTCAA TAGCTTATCT CAACCGCCGC AGGTCAGTTT GCATGGCGAT
AAGGGCCAGC ATTTGGCCTG GGTTGAGCAG AACCAAGTCA AAGCGGGTCA TCCTCTGTAT
GACTATGCTG GACTCTGGCA ACTACCTGAG TTTAAAGAGC TTAAAGCCGA AGATGGCCAA
ATCTTACAAA CGCGTTTATT TAAACCCGTG CCCTTCGATG CGGGGAAAAA GTACCCCGCG
GTCGTGCGGG TTTATGGCGG CCCACATGCG CAGTTGGTGA CCAATAGCTG GAGTGAGCAA
GATTACTTTA CCCAATACTT AGTTCAGCAA GGTTATGTGG TATTTCAATT GGATAACCGC
GGCAGTGCCC ACAGGGGTAC CAAGTTTGAG CAAGTGATTT ACCGTCATTT GGGCGAAGCG
GAAGTCAATG ATCAAAAGGT TGGGGTGGAT TATCTGCGCA GCTTGCCCTT TGTCGATAGC
AATAATGTGG CGATTTATGG CCACAGCTAC GGCGGTTATA TGGCTTTGAT GAGTCTATTT
AAGGCGCCGG ATTACTTTAA GGCGGCGATT TCGGGTGCAC CTGTCACCGA TTGGCGTTTG
TATGACACCC ATTATACCGA GCGTTATTTA GCCCATCCCG CAAGCAATGA GCAAGGTTAT
GAAGCCAGTA GCCTGTTCCC CTATGTGAAA AACTACCAGT CGGGGCTGTT GATGTACCAC
GGCATGGCCG ACGATAACGT GTTGTTTGAA AACAGTACTC GCGTCTATAA GGCGTTGCAG
GACGAAGGCA AGTTGTTTCA AATGATTGAT TATCCAGGTT CTAAGCATTC AATGCGTGGC
GAAAAGGTGC GTAACCATTT ATATCGCTCG CTGGCGGATT TCCTCGATAG ACAGCTTAAA
AACGGCAAAT AA
 
Protein sequence
MTKNGLASSL RAVKLGASSL LIASQLAMIS TLTTASAYAL EGGMTPLTIE RMNASPALAG 
TSPRGLKLSP DGQKVTYLAG RKDNQNFYDL WQMDVKTGKS SLLLNADKLA SNELSDEEKA
RRERQRIYGE GIMEYFWADD SKALLIPAAG NLYYFSLADN SVSLLPIGEG FATDARLSPK
GNFVSFVRDQ NLYVLNLATK KLEAMTTDGG GVIKNAMAEF VAQEEMDRMT GYWWAPDESA
IAFTRIDESA VELVTRNEIY ADGIKLTEQR YPAAGKNNVE IQLGVVTLKN KAIDWVTLSD
DKNKDIYLPR VDWLPDSKHL SFQWQSRDQQ KLDLQLVALD SLTKPKTLVK ERSDAWVNLN
NDLHFLKQQS AFIWASERDG FNHLYLFDLK GKLKTQLTKG NWAVDELEFI DETAGWVYFT
GRKDTPIEKH LYRVPLAGGN IERVSSEAGM HDPVFADNQS VYLDYFNSLS QPPQVSLHGD
KGQHLAWVEQ NQVKAGHPLY DYAGLWQLPE FKELKAEDGQ ILQTRLFKPV PFDAGKKYPA
VVRVYGGPHA QLVTNSWSEQ DYFTQYLVQQ GYVVFQLDNR GSAHRGTKFE QVIYRHLGEA
EVNDQKVGVD YLRSLPFVDS NNVAIYGHSY GGYMALMSLF KAPDYFKAAI SGAPVTDWRL
YDTHYTERYL AHPASNEQGY EASSLFPYVK NYQSGLLMYH GMADDNVLFE NSTRVYKALQ
DEGKLFQMID YPGSKHSMRG EKVRNHLYRS LADFLDRQLK NGK