Gene Sbal223_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_0653 
Symbol 
ID7089784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp787969 
End bp791094 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content52% 
IMG OID643459566 
ProductMicrobial collagenase 
Protein accessionYP_002356596 
Protein GI217971845 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA CACTGCTCTT CGCAGCTATT AGCTTAGCTT TTGCGACTCC CGCTCTCGCC 
CATGACACTC ATACGAGTCA CCCATTAGGC AGCAACACAC CCAGCAATCC ACAAGCGCAA
CCGGCACCAC GACCACCCTC GGCGCCCATT AGCCAAATAG AATCCCATAG ACCGCTACTG
CGGGAGACGC CATTGATGGC ATCCCCGAAT GAGCCAACTA TGACAGAACA GACAATGACA
GAACCGCCTA AGATGCATTC ACGCAAGAAG CGAGAAGTAT CTGAGCAACC GCCACTTAAG
TTACAGAGTC GCAGCGCCGC ACTGCAGAGC AATGACTGCA GTGACTTTGT CGGTAAATCG
GGCCAAGCCC TAGTGGATCA ATTGAGCCAA TCGACACCCG AATGCGTCGG TAAGCTCTAT
AGCCTCAAAG GCAGTGCTGC GACCGCGCTG TTTAGCGAGG CCAATGTGAT CAGTGTAGCT
AACGCCATTG CCACTAAAGC CAAGGACTAC ACTGGGGTCG ATGTTCAGCA TCTTGAATCG
CATATTTACT TTGTCCGCGC CGCCCTCTAT GTGCAGTTTT ACAGCCCAAA TGATGTACCA
GCTTACAGCA GCGCCGCTAA GGCCAGCTTA AAATCGGCTC TCAACGCTTT ATTTGCTAAC
GCTGCGATTT GGACTGTCTC CGATGACAAT GCTGGCGTGT TGAAAGAAGC CTTAATCCTA
ATTGACTCTG CCGAGCTCGG CGCCGATTTC AATCATGTCA CCATAAAAGT GCTAACGGAC
TACGACGCTA ACTGGCAAGC CAGTTTCGCC ATGAACGCCG CAGCCAACTC AGTGTTTACC
ACCTTATTCC GCGCCCAGTG GAATGATGAC ATGCAGGCGC TGTTCGCACG TGACCAAGGC
ATTTTAGATG CGCTGAATAA CTTCCAGCTC GAACACCGCG ACTTGTTAGG CACCAATGCA
GAGTATCTGT TAGTCAACTC AGTCAAAGAG TTATCAAGAC TGTATTACAT TGATGCCATG
CGCCCGCGAG TGACTCAGCT AGTTAAAAAC ATCCTCAGCA GCACCAGCAA AAGCGAACCG
AGTAAGGTGC TTTGGTACGC GGCGGCAGAA ATGGCCGACT ATTACGATCG CAGCCATTGT
AACGATTACA ATATCTGTGG TTTTAAGGCG CAGCTAGAGG CCGATGCCCT ACCGTTCAAC
TGGAAATGCT CCGACAGTCT CAAGATCCGC GCCCAAGATC TCTATCAAGA TCAAGCCAAG
TGGGCCTGTG ATGTGCTGAC CAGCCAAGAG AGCTATTTCC ACAGCAAACT CGAAACGGGC
ATGCAACCTG TAGGGCAAGA TAACAACGAT GACTTAGAGC TAGTGATCTT TGGCAGCTCG
TCTGAGTATA AATCCTTAGC CAACAGTATC TTTGGCATCA ATACCGACAA CGGCGGCATG
TACCTTGAAG GCTCGCCCGC CGGACTCAAA AATCAGGCGC GTTTTATCGC CTATGAAGCC
GAATGGCGCA CGCCGGATTT CCATGTTTGG AACCTACAAC ACGAATATGT GCACTACCTC
GACGGGCGCT ATAACCTGTT TGGTGACTTT AGTCGCGGCA CGTCGGCCAA TACCATTTGG
TGGATTGAAG GTCTAGCGGA ATACATTTCT TACCGTGACG CCAATACCGC GGCCATTGCC
ATGGGCGAAA CCGGTGAATT TATGTTGTCG ACCATATTCA AAAACAACTA TGAATCGGGC
CAAGACCGTA TCTACCGTTG GGGTTATCTG GCGGTGCGCT TTATGTTTGA GCATCACAGG
GACGATGTAA GACAGATTCT CGCCTACCTA CGTAACGACC AATACGCGGA ATATCAAACC
TTTATGGATG GTATCGGTAC ACGTTACGAC AACGAGTGGC AAGGTTGGCT CGCCAGCGGC
TTGAGCACGG CTGACGATGG CATTGTCGAT AAAGGCCCAA GTGATGTCGA TGCTGAGCCC
AGTGGCCGTG AAGGTAACTG GGCTGGTCCT GCGGGCACTA TCAGTAAGGA TTACTCGCCT
TGCCAAGTGA CGAACGAAGC CTACCGTTAC ACAGAATCGG CCAGCCTTAA AATCAATGTG
CCGATGGAAT GTATTGATTC TAAACTCGGC CGCGCCAGTT TCAGTTTCAC TAATACCGAT
CGCTCGGCCC AAGATCTTTG GATCAAGATT GGCGGCGGTT GGGGTGATGC CGACATTTAT
TTCAATTCTA AAGGTTGGGC AAGCCCAGAA CAAAACCAAG GCGCGGGGAT CGGCAACGGT
AACTATCAAG TCATTAAAGT GCAGTTAAAC CCTGACGAAT ATTGGCACTA CATTACCTTG
TCGGGGGATT TTGGCGGCGT CGATATGCAA GTCAGCACCA CTGAGTTGTT CGCTGATGTT
GATCCCGATC TGGGAGATGG CGGCGTGGAT CCTGAGCCAC CAGCAAATTG CGGTGCAGTG
ACCTTAGATT ACGGCCAGCT CACACTGGGC AAAAACGAGT GTATCAGTGG TGGACGTAAC
AGCTTCTACT TCTGGGTTGA GGAAGATAAT ACCCAGTTCA CCGTCACCAC TAAGGGCGGC
TCTGGGGATG CCAATCTCTA CTTCAATGCC AGCCAGTGGG CCGACGCCGA TAATGCCGAT
GCCAAGAGCA CTCAAGTAGG TAATCAAGAG TCGCTAAGTT TTAGTGCCAA TCGCGGCTGG
CGTTATATCA CAGTGGACAC TGCCACTGAG TTTAGCGGCG TCACTCTCAC CCTCAACACA
GGCGCGAGCA ACACACCTAC GCCAGCTCTA ATCGCCAATG CCTGCGCGAC GCAATCGCCC
TTGAGTCATG GTGAACTGAG TTCAGGCAAA GCCATCTGCA CCGCCGATGG CCGCAGCGAT
TACTATCTTT GGGTGCCAGA AGGTACGAGT CAGTTAAGTA TCAACTCGGC CCACGGCAGT
GGTGATGTGA GCCTGTTTTC GGGGACGACT TGGGCCAATA CTCAACACTT CGATGCAGCG
TCTGTCACGC CTGCCAACAC TCAAGAAAGT ATCACAGTCG ATGCGCCAAA CGTGGGGTGG
TATTACATCA CAGTGCAGAG TGAACCACAG AGTTCAGGTG TCGCACTGCA AGTCGATTTA
CGCTAG
 
Protein sequence
MKNTLLFAAI SLAFATPALA HDTHTSHPLG SNTPSNPQAQ PAPRPPSAPI SQIESHRPLL 
RETPLMASPN EPTMTEQTMT EPPKMHSRKK REVSEQPPLK LQSRSAALQS NDCSDFVGKS
GQALVDQLSQ STPECVGKLY SLKGSAATAL FSEANVISVA NAIATKAKDY TGVDVQHLES
HIYFVRAALY VQFYSPNDVP AYSSAAKASL KSALNALFAN AAIWTVSDDN AGVLKEALIL
IDSAELGADF NHVTIKVLTD YDANWQASFA MNAAANSVFT TLFRAQWNDD MQALFARDQG
ILDALNNFQL EHRDLLGTNA EYLLVNSVKE LSRLYYIDAM RPRVTQLVKN ILSSTSKSEP
SKVLWYAAAE MADYYDRSHC NDYNICGFKA QLEADALPFN WKCSDSLKIR AQDLYQDQAK
WACDVLTSQE SYFHSKLETG MQPVGQDNND DLELVIFGSS SEYKSLANSI FGINTDNGGM
YLEGSPAGLK NQARFIAYEA EWRTPDFHVW NLQHEYVHYL DGRYNLFGDF SRGTSANTIW
WIEGLAEYIS YRDANTAAIA MGETGEFMLS TIFKNNYESG QDRIYRWGYL AVRFMFEHHR
DDVRQILAYL RNDQYAEYQT FMDGIGTRYD NEWQGWLASG LSTADDGIVD KGPSDVDAEP
SGREGNWAGP AGTISKDYSP CQVTNEAYRY TESASLKINV PMECIDSKLG RASFSFTNTD
RSAQDLWIKI GGGWGDADIY FNSKGWASPE QNQGAGIGNG NYQVIKVQLN PDEYWHYITL
SGDFGGVDMQ VSTTELFADV DPDLGDGGVD PEPPANCGAV TLDYGQLTLG KNECISGGRN
SFYFWVEEDN TQFTVTTKGG SGDANLYFNA SQWADADNAD AKSTQVGNQE SLSFSANRGW
RYITVDTATE FSGVTLTLNT GASNTPTPAL IANACATQSP LSHGELSSGK AICTADGRSD
YYLWVPEGTS QLSINSAHGS GDVSLFSGTT WANTQHFDAA SVTPANTQES ITVDAPNVGW
YYITVQSEPQ SSGVALQVDL R