Gene Shewana3_0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_0649 
Symbol 
ID4476859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp766577 
End bp768868 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content51% 
IMG OID639725184 
Productdipeptidyl-peptidase IV 
Protein accessionYP_868293 
Protein GI117919101 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000821828 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00018889 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTAAAA ATGGGTTAGC TTCTTCCCTG CGCTCGGTGA CATTGGGCGC CTCATCTCTA 
CTGATAGCGA GTCAACTGGC CATGGTTTCT AGCGCGATAT TTACCCCCGC CTATGCGCTT
GAGGGCGGCA CAACGCCGCT GACTATCGAG CGCATGAATG CGTCGCCTGC CTTGGCGGGC
ACGAGTCCCC GAGGATTAAA ATTATCCCCC GATGGTCAAA GGGTGACCTA TCTTGCGGGA
CGTAAAGACA ACCAAAATTT TTACGATCTC TGGCAGATGG ACGTGAAGAC GGGCGAATCT
AGCCTGTTGC TAAACGCCGA TAAGCTGGCG AGCAATGAGT TATCCGATGA AGAAAAGGCC
CGCCGCGAGC GCCAACGTAT TTATGGCGAA GGCATTATGG AATATTTCTG GGCCGACGAT
AGCAAGGCGC TGTTGATCCC TGCCTCGGGT AATCTGTATT ACTTCTCCTT AGCGGATAAC
AGCGTTAGCC AACTGCCGAT AGGCGAGGGC TTTGCCACCG ATGCTCGCCT ATCGCCCAAG
GGGAATTTTG TGTCCTTCGT GCGGGACCAA AATCTGTATG TGCTCAATTT AGCGACTAAA
AAGCTTGAGG CCATGACTAC AGACGGTGGC GGCGCGATTA AAAATGCCAT GGCGGAATTT
GTTGCCCAGG AAGAAATGGA TCGTATGACG GGTTACTGGT GGGCGCCCGA TGAATCGGCC
ATCGCCTTTA CCCGTATCGA TGAGTCGGGC GTTGAGTTGG CCACCCGCAA TGAAATCTAC
GCCGATGGCA TTAAGCTTAC CGAGCAGCGT TATCCGGCTG CGGGTAAAAA CAATGTCGAG
ATCCAACTGG GTGTTGTGAC CCTTAAGAAT AAAGCCATCA ATTGGGTCAC CTTAAGCGAT
GATAAAAATA AAGATATTTA CTTGCCTCGC GTCGATTGGT TGCCGGATAG CAAGCACTTA
TCCTTCCAGT GGCAGAGCCG CGATCAACAA AAATTAGACC TGCAACTGGT GGCTCTCGAT
GCGCTGACCA AGCCGAAAAC CTTAGTCAAA GAGCGTAGCG ATGCTTGGGT GAATCTTAAC
AATGACCTGC ACTTCTTAAA GCAACAATCG GCGTTTATCT GGGCCTCGGA GCGTGACGGC
TTTAATCATC TGTATTTGTT TGACCTAAAA GGTAAGCTGA AGACGCAATT GACCAAGGGC
AATTGGGCGG TCGATGAGCT GGAATATATT GATGAAACCG CAGGTTGGGT GTATTTCACT
GGCCGCAAAG ATACCCCTAT CGAAAAACAC CTCTACCGTG TGCCGTTAGC GGGTGGCAAG
GTTGAGCGCG TGAGCAGCGA GGCGGGGATG CACAATCCGG TATTTGCCGA TAATCAAAGC
GTGTATCTAG ATTATTTCAA TAGCTTATCT CAGCCACCAC AGATCAGTTT GCACGGCGAT
AAGGGCCAAC ATCTGGCGTG GGTAGAGCAG AACCAAGTCA AAGCGGGCCA TCCTCTGTAT
GACTATGCTG GCCTTTGGCA ATTACCAGAG TTTAAAGAGC TTAAGGCGGA AGACGGACAA
GTCCTGCAAA CGCGCCTCTT TAAACCCATG CCCTTCGATG CGAGCAAAAA GTATCCGGTA
GTCGTACGTG TTTATGGCGG CCCGCATGCG CAATTGGTTA CCAATAGCTG GAGTGAGCAG
GATTACTTTA CCCAATACTT AGTGCAACAA GGTTATGTGG TATTTCAACT GGATAACCGT
GGCAGCGCCC ACAGAGGCAC CCAGTTCGAG CAAGTGATTT ACCGTCACTT GGGTGAAGCG
GAGGTGAACG ACCAAAAGGT TGGCGTCGAT TATCTGCGCA GCTTGCCCTT TGTCGATAGC
AACAACGTGG CGATTTATGG CCACAGCTAC GGCGGTTATA TGGCCTTGAT GAGCCTGTTT
AAGGCGCCGG ATTACTTTAA GGCGGCGATT TCGGGCGCGC CCGTCACAGA TTGGCGTTTG
TACGACACTC ACTACACCGA GCGTTATCTC GCCCATCCGG CGAATAATGA ACAAGGTTAC
GAAGCCAGTA GCGTGTTCCC CTATGTGAAA AACTACCAGT CGGGGCTGTT GATGTACCAC
GGCATGGCCG ACGATAACGT GTTGTTTGAA AACAGCACTC GCGTGTACAA GGCGTTGCAG
GACGAAGGCA AACTGTTCCA GATGATTGAT TATCCTGGCT CTAAGCATTC GATGCGCGGT
GAAAAAGTGC GTAATCACTT ATATCGCTCG CTGGCGGATT TCCTCGATAG ACAGCTTAAA
AGCGGCGAAT AA
 
Protein sequence
MTKNGLASSL RSVTLGASSL LIASQLAMVS SAIFTPAYAL EGGTTPLTIE RMNASPALAG 
TSPRGLKLSP DGQRVTYLAG RKDNQNFYDL WQMDVKTGES SLLLNADKLA SNELSDEEKA
RRERQRIYGE GIMEYFWADD SKALLIPASG NLYYFSLADN SVSQLPIGEG FATDARLSPK
GNFVSFVRDQ NLYVLNLATK KLEAMTTDGG GAIKNAMAEF VAQEEMDRMT GYWWAPDESA
IAFTRIDESG VELATRNEIY ADGIKLTEQR YPAAGKNNVE IQLGVVTLKN KAINWVTLSD
DKNKDIYLPR VDWLPDSKHL SFQWQSRDQQ KLDLQLVALD ALTKPKTLVK ERSDAWVNLN
NDLHFLKQQS AFIWASERDG FNHLYLFDLK GKLKTQLTKG NWAVDELEYI DETAGWVYFT
GRKDTPIEKH LYRVPLAGGK VERVSSEAGM HNPVFADNQS VYLDYFNSLS QPPQISLHGD
KGQHLAWVEQ NQVKAGHPLY DYAGLWQLPE FKELKAEDGQ VLQTRLFKPM PFDASKKYPV
VVRVYGGPHA QLVTNSWSEQ DYFTQYLVQQ GYVVFQLDNR GSAHRGTQFE QVIYRHLGEA
EVNDQKVGVD YLRSLPFVDS NNVAIYGHSY GGYMALMSLF KAPDYFKAAI SGAPVTDWRL
YDTHYTERYL AHPANNEQGY EASSVFPYVK NYQSGLLMYH GMADDNVLFE NSTRVYKALQ
DEGKLFQMID YPGSKHSMRG EKVRNHLYRS LADFLDRQLK SGE