Gene Sbal223_2259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2259 
Symbol 
ID7086386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2681461 
End bp2683326 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content48% 
IMG OID643461157 
ProductPeptidyl-dipeptidase A 
Protein accessionYP_002358181 
Protein GI217973430 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00205408 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTATTT CATTGAATCG TCGCAGTAAA ATCGCCCTCA CAGTGGCCTT AACGCTGGGA 
TTAACGGCCT GTAATGATGC ACAAAGTAAA ACCGACAAGC CTGCTAGCAC TGAAGCGGCT
GTCGTCAATA CCGCCCCAGA TAAAACACAA GCCATTGCTT TTATTAACGA TGCTGAAGCC
AAAATGGCTG AATTGTCGAT TGAGTCTAAT CGCGCCGAAT GGATTTACAG CAACTTTATT
ACCGATGATA CCGCAGCCCT GTCTGCCGCG GTCGGTGAAA AGGTCAGCGC AGCGTCGGTA
AAATTTGCAA CAGAGGCGGC TAAGTACGCC AATGTGCAGC TCGATCCCGT TAATGCCCGC
AAACTCAATA TATTACGCAG TGCGTTAGTA TTGCCCGCCC CGCTCGATCC TGCGAAAAAT
GCCGAGCTTG CGCAAATTAG CTCAGAGTTA AATGGCTTAT ACGGCAAAGG CAAATACTGT
TTTGCCGATG GCAAGTGTAT GACCCAGCCT GAGCTATCGA GCTTGATGGC CGAATCACGG
GATCCCGCTA CGTTGCTGGA AGCGTGGAAA GGCTGGCGTG AAATTGCCAA ACCTATGCGT
CCCTTGTTTC AACGTGAAGT GGAACTGGCC AATGAAGGCG CGAAGGATCT TGGTTATGCA
AACCTGTCTG AGCTATGGCG TAGTCAATAT GATATGAAAC CCGATGATTT TTCACAGGAA
CTCGATCGTC TTTGGGGCCA AGTGAAACCC CTTTATGAAT CATTGCACTG TTACGTGCGC
GGTGAACTCA ATAAAGAATA CGGCGATACC GTTGCACCGA CTACAGGACC TATCCCAGCA
CATTTACTCG GCAATATGTG GGCCCAGCAA TGGGGAAATG TGTATGATTT AGTCGCCCCA
GACAATGCCG ACCCGGGTTA CGATGTCACT GAGCTACTGG CAAAAAATGG CTATGACGAG
CATAAAATGG TGAAACAAGC CGAAGGCTTC TTCACGTCTC TAGGATTTGC GCCATTGCCA
GAAAGTTTTT GGGCACGTTC TCTATTTGTT CAGCCAAAGG ATCGTGATGT GGTTTGCCAC
GCTTCGGCAT GGGATCTCGA TAATCTCGAC GATATTCGCA TAAAAATGTG TATCCAAAAG
ACCGCCGAAG ATTTTAGCGT GATCCATCAC GAACTTGGAC ATAACTTCTA TCAACGCGCT
TATAAGAAGC AACCATTCCT GTTTAAAAAC AGCGCCAACG ATGGTTTCCA TGAAGCGATT
GGTGACACGA TTGCGCTGTC GATCACCCCA AGCTACTTAA AGCAGATTGG CTTATTAGAT
GAAGTGCCTG ATGCCTCTAA GGACATTGGC CTCTTACTGA AGCAAGCTTT AGATAAAATT
GCCTTCTTGC CCTTTGGTCT GATGATAGAT CAGTGGCGCT GGAAAGTGTT TAGTGGTGAA
ATCACGCCTG CCCAATATAA CCAAGCATGG TGGGATCTCA GGGAAAAATA CCAAGGCGTA
AAAGCACCGA CGAAGCGCAG CGAAGCTGAC TTTGATCCAG GCGCTAAATA CCATGTGCCA
GGTAATGTGC CTTACACCCG CTACTTCCTC GCGCATATTC TGCAATTCCA GTTCCATCAA
GCGCTATGTG AAACTGCGGG CGATAAAGGT CCGGTTCATA GATGCAGTAT TTATGGCAAT
CAAGCTGCGG GAGAGAAACT CAATAAGATG CTCGAGTTAG GCTTAAGTAA GCCATGGCCT
GAAGCATTGA AAGAAGTCAC CGGCAAAGAA ACCATGGATG CCAAAGCTGT GCTCGATTAC
TTCGCACCGC TTAAAACATG GTTAGATGAG CAAAATACCA CCGCAAACCG CCAGTGTGGT
TGGTAA
 
Protein sequence
MVISLNRRSK IALTVALTLG LTACNDAQSK TDKPASTEAA VVNTAPDKTQ AIAFINDAEA 
KMAELSIESN RAEWIYSNFI TDDTAALSAA VGEKVSAASV KFATEAAKYA NVQLDPVNAR
KLNILRSALV LPAPLDPAKN AELAQISSEL NGLYGKGKYC FADGKCMTQP ELSSLMAESR
DPATLLEAWK GWREIAKPMR PLFQREVELA NEGAKDLGYA NLSELWRSQY DMKPDDFSQE
LDRLWGQVKP LYESLHCYVR GELNKEYGDT VAPTTGPIPA HLLGNMWAQQ WGNVYDLVAP
DNADPGYDVT ELLAKNGYDE HKMVKQAEGF FTSLGFAPLP ESFWARSLFV QPKDRDVVCH
ASAWDLDNLD DIRIKMCIQK TAEDFSVIHH ELGHNFYQRA YKKQPFLFKN SANDGFHEAI
GDTIALSITP SYLKQIGLLD EVPDASKDIG LLLKQALDKI AFLPFGLMID QWRWKVFSGE
ITPAQYNQAW WDLREKYQGV KAPTKRSEAD FDPGAKYHVP GNVPYTRYFL AHILQFQFHQ
ALCETAGDKG PVHRCSIYGN QAAGEKLNKM LELGLSKPWP EALKEVTGKE TMDAKAVLDY
FAPLKTWLDE QNTTANRQCG W