Gene Sbal195_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_3843 
Symbol 
ID5755658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4525116 
End bp4527431 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content48% 
IMG OID641290185 
Productpeptidase S9B dipeptidylpeptidase IV subunit 
Protein accessionYP_001556263 
Protein GI160876947 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.036555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000141613 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGATTAAAA ATAGGTTAAC TTGGTTCCCT GCGTGCTCAC TTAAGGTGCG CGCCGTCCCC 
CTACTGATAG CGAGCCAAAT GGCCATGATT TCGACAGCTA TGATTTCAAC AACCATCCCA
GCTTTGGCAA TGGAAGGCGG AAAACTGCCA TTGACGATTG AGCGAATGAA TGCCTCGCCC
GCCTTAGCAG GCTCCAGTCC CCGTGGTTTA AAATTGTCAC CCGATGGGCT GCGCGTCACC
TATCTTGCTG GCCGTAAAGA CAATCAAAGT TTTTATGATC TCTGGCAGAT GGATGTCAAA
AGCGGTGAGT CGAGTTTGCT ATTAAACGCT GATAAACTCG CGACCAATGA ATTATCCGAT
GAAGAAAAAG CCCGCCGCGA GCGCCAACGT ATTTATGGTG AAGGCATTAT GGAGTACTTC
TGGGCCGATG ATAGCCAAGC GCTATTAATT CCGGCCTCTG GCAATCTGTA TTACTTTTCC
CTCGTAGACA ATAGCGTGAC TCAATTGCCG ATTGGCGAAG GGTTTGCTAC AGATGCACGC
TTATCGCCCA AGGGACATTT TGTGTCTTTC GTGCGGGATC AAAATTTGTA TGTGTTAGAT
CTGGCGACGA AAAAACTCCA AGTTATGACG ACCGATGGCG GCGGGGTGAT TAAAAATGCC
ATGGCCGAGT TTGTCGCTCA AGAAGAAATG GATCGCATGA CTGGCTATTG GTGGGCGCCC
GATGAGTCGG CTATCGCCTT CATTCGTATC GACGAGTCCG CGGTCGAGCA AGTGACGCGC
AATGAGATCT ATGCCGATGG CATTAAACTC ACCGAGCAGC GTTACCCAGC AGCAGGTAAA
AACAACGTCG ACATCGCGTT AGGTGTGGTC ACGCTAAAAG ATAAGGCCAT CAATTGGGTG
AGCCTGCGCG AAGAGAATAG CAAAGAAAAG AGCAAAGACA TTTACCTGCC GCGTGTCGAT
TGGTTGCCCG ATAGCAAACA CTTGTCGTTC CAGTGGCAGA GCCGCGATCA ACATCAGCTC
GATTTGCAGT TAGTGGCGTT AGATGCACTG ACTAAGCCAA AAACCTTAGT GAAAGAACGC
AGTGATGCTT GGGTAAACCT CAATAACGAT CTGCATTTCT TAAAACAGCA GTCTGCCTTT
ATTTGGGCGT CTGAGCGTGA CGGCTTTAAT CATCTGTATC TTTTTGACTT AAAAGGCAAA
CTCAAAACGC AATTGACTAA GGGCGAGTGG GCTGTCGATG AGTTGGAATA CATAGATGAA
ACCGCAGGCT GGGTGTATTT CACCGGCAGC AAAGACACGC CAATCGAGAA ACAGCTTTAT
CGCGTACCGT TAGCGGGTGG CAAGGTTGAG CGCGTGAGCA AGCAAGCGGG GATGCACAAT
CCTGTTTTCG CCGATAATCA GAGTGTATAT CTGGATTATT TCAATAGCTT ATCTCAGCCA
CCGCAAATCA GTTTACACGG TGACAAGGGC CAGCAACTGG CTTGGGTCGA GCAAAATGCG
GTTAAGCAAG GTCATCCTTT ATATGATTAT GCAGGGCTGT GGCAAATCCC TGAATTTGGT
GAACTGAAAG CCGAAGATGG CCAAGTGCTA CAAACTCGTT TATTCAAACC CGTTCCCTTC
GATGCGAGTA AGAAATACCC TGTCGTAGTG CGGGTTTATG GTGGGCCGCA CGCCCAGTTA
GTGACTAATA GTTGGAGCGA GCAGGACTAC TTTACCCAGT ATCTTGTGCA ACAAGGGTAT
GTGGTATTCC AATTAGATAA CCGCGGCAGT GCCCACAGAG GCACTCGGTT TGAGCAGGTG
ATTTACCGTC ACTTGGGCGA AGCTGAAGTG AATGATCAAA AAGTGGGGGT GGAGTATTTA
CGGAGTCTGC CCTTTGTCGA TGCCGATAAT GTGGCGATTT ATGGCCACAG CTACGGTGGT
TACATGGCTT TGATGAGTTT ATTTAAGGCG CCGGATTACT TTAAAGCCGC GATTTCGGGC
GCACCTGTGA CCGACTGGCG CTTGTATGAC ACCCATTATA CTGAGCGTTA TTTAGGTCAT
CCCGAAGGTA ATGAAAAGGG TTATGAAGCC AGTAGCGTGT TCCCTTACGT GAAAAACTAT
CAAGCGGGTC TATTGATGTA TCACGGCATG GCTGACGATA ACGTCTTGTT TGAAAACAGC
ACTCGAGTTT ATAAAGCGCT GCAGGATGAA GGCAAATTAT TCCAGATGAT CGATTATCCG
GGATCTAAAC ATTCGATGCG TGGCGAGAAA GTGCGTAATC ACTTATACCG CTCATTAGCG
GATTTCCTCG ATAGACAGCT GAAAAGCGCT AAGTAG
 
Protein sequence
MIKNRLTWFP ACSLKVRAVP LLIASQMAMI STAMISTTIP ALAMEGGKLP LTIERMNASP 
ALAGSSPRGL KLSPDGLRVT YLAGRKDNQS FYDLWQMDVK SGESSLLLNA DKLATNELSD
EEKARRERQR IYGEGIMEYF WADDSQALLI PASGNLYYFS LVDNSVTQLP IGEGFATDAR
LSPKGHFVSF VRDQNLYVLD LATKKLQVMT TDGGGVIKNA MAEFVAQEEM DRMTGYWWAP
DESAIAFIRI DESAVEQVTR NEIYADGIKL TEQRYPAAGK NNVDIALGVV TLKDKAINWV
SLREENSKEK SKDIYLPRVD WLPDSKHLSF QWQSRDQHQL DLQLVALDAL TKPKTLVKER
SDAWVNLNND LHFLKQQSAF IWASERDGFN HLYLFDLKGK LKTQLTKGEW AVDELEYIDE
TAGWVYFTGS KDTPIEKQLY RVPLAGGKVE RVSKQAGMHN PVFADNQSVY LDYFNSLSQP
PQISLHGDKG QQLAWVEQNA VKQGHPLYDY AGLWQIPEFG ELKAEDGQVL QTRLFKPVPF
DASKKYPVVV RVYGGPHAQL VTNSWSEQDY FTQYLVQQGY VVFQLDNRGS AHRGTRFEQV
IYRHLGEAEV NDQKVGVEYL RSLPFVDADN VAIYGHSYGG YMALMSLFKA PDYFKAAISG
APVTDWRLYD THYTERYLGH PEGNEKGYEA SSVFPYVKNY QAGLLMYHGM ADDNVLFENS
TRVYKALQDE GKLFQMIDYP GSKHSMRGEK VRNHLYRSLA DFLDRQLKSA K