Gene Sbal223_2556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2556 
Symbol 
ID7086122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3038841 
End bp3040517 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content50% 
IMG OID643461448 
Productpeptidase M28 
Protein accessionYP_002358472 
Protein GI217973721 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00050336 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000133142 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCGACGTC TGTATCCACT CTGTGCGCTG GTTATGCTCA GTGCCTGTAG TCCGAATGCA 
TCACAGCCTG ATAGCAACAG CACCTTGGCT GAACTCAAAC CAGTGCAGTT TAATGAAGCA
CGCTTTAGAA ATGATATTAA AGCCCTGTCA TCCGATGAAT TTGAAGGTCG TGCGCCGACC
ACTCACGGTG AAAAACTCAC CTTGGACTAT TTGACTAAAG CCTTTACAGA GATGGGACTC
AAGGGCGCGT ACCAAGGCAG CTTTTTGCAA CCTGTGCCTA TGGTGAGCTA TACCGCAGAT
GAAGCGCAGC AAGTTACGCT GGCAGGATTA CCCTTTAAAT ACCGAAAAGA TCTAGTGCTC
AGCAGTCGCC ACGATAACGG TGGTATTAGT ATTGAGAACG CACCATTAGT CTTCGTCGGC
TATGGGGTGA ACGCACCCGA ATATGGTTGG AATGATTACC AAGATCTCGA CATGAAAGGC
AAAATCGCGG TGATCTTGGT GAACGATCCC GGCTTTGCTC GCCCTGACTC GGGTAAATTC
AATGGCAAAG CCATGACCTA TTACGGCCGC TGGAGCTATA AGTTTGAAGA GGCCAGCCGT
CAAGGCGCCC TTGGTGCACT GATCATCCAT GATACTGAGC CCGCCTCTTA TCCTTGGTCT
GTGGTAGAAA ATAGCTGGAC TGGGCCACAA CAAGATCTAG TGCTGAGTAA AGCAGAACAA
GATAGTCGCA TTCAAGTCGA AGGCTGGCTT ACATTAGACG CAGCAACCCA ACTTTTTGAT
AAATCTGGTT TATCACTGCC GAGCTTGATG GCGCGCGCTG CCGACAGCCC AATCAATGTC
CCCCTTGAGC AAACGGCCAA CATCGCCTTT AAAAATAAAG CCGAATACGC GAATAGCTAC
AACGTGGTCG CCACACTCCC CGGCAGCACA CAAGCCGACG AGCAGATCCT CTACACCGCC
CATTGGGATC ACATAGGTAA AGACGAGACG AAAGCAGGCG ATCAGATTTA CAACGGCGCC
ATGGACAATG CCTCAGGCAC TGCCGGTATT CTCGAAATCG CCAGACAATT AGCCGACAAT
GCGAAACAAG GCCACGGTTT AGCGCGCTCG GTCACCTTTA TTGCCACCAC AGGTGAAGAA
CAGGGCTTAC TCGGTTCACG CTATTACGCC GCTAATCCAC TTTATCCTAT CGACAAAACC
GTGGCAGTAT TAAACCTCGA CAGCACCAAT ATCTACGGCA AAACCAAGGA TTTTACGATT
GTCGGCAAAG GCAAGTCTGA GTTAGAAACT TACCTTATCG ATGCGGCGAA ACAACAGAAC
CGCATCGCCA TGGGTGAGAA AAATCCCGCT TCGGGCGGTT TCTTCCGTTC AGATCATTTC
AGCTTTGCCA AACTGGGCGT CCCAGCGGTA TTTGCCAGCG GCGGTAGCGA CCCCGTTGAT
GAAGCCACGG CTGCTTATAA AACCCAGATG CAAGCCACAA TGAAAGGCTG TTATCACAAT
GTCTGCGATG AATACCATGA AGACTGGGAT TTGAGTGGCG CGCTGCAAGA TTTGCAAGTC
TACTACCAAG TCACACGCAC ATTAGGCAGC AGCAAAGATT GGCCGGGATA CTATCAAGGC
ACAGAGTTTA ATAGCCTGCG CCCAGCTAAA ACGACGCTGG TGACAGATGC AAAATAG
 
Protein sequence
MRRLYPLCAL VMLSACSPNA SQPDSNSTLA ELKPVQFNEA RFRNDIKALS SDEFEGRAPT 
THGEKLTLDY LTKAFTEMGL KGAYQGSFLQ PVPMVSYTAD EAQQVTLAGL PFKYRKDLVL
SSRHDNGGIS IENAPLVFVG YGVNAPEYGW NDYQDLDMKG KIAVILVNDP GFARPDSGKF
NGKAMTYYGR WSYKFEEASR QGALGALIIH DTEPASYPWS VVENSWTGPQ QDLVLSKAEQ
DSRIQVEGWL TLDAATQLFD KSGLSLPSLM ARAADSPINV PLEQTANIAF KNKAEYANSY
NVVATLPGST QADEQILYTA HWDHIGKDET KAGDQIYNGA MDNASGTAGI LEIARQLADN
AKQGHGLARS VTFIATTGEE QGLLGSRYYA ANPLYPIDKT VAVLNLDSTN IYGKTKDFTI
VGKGKSELET YLIDAAKQQN RIAMGEKNPA SGGFFRSDHF SFAKLGVPAV FASGGSDPVD
EATAAYKTQM QATMKGCYHN VCDEYHEDWD LSGALQDLQV YYQVTRTLGS SKDWPGYYQG
TEFNSLRPAK TTLVTDAK