Gene Sbal_3732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_3732 
Symbol 
ID4843317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp4368279 
End bp4371404 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content51% 
IMG OID640120999 
Productcollagenase 
Protein accessionYP_001052075 
Protein GI126175926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACA CACTGCTCTT CGCAGCTATT AGCTTAGCTT TTGCGACTCC CGTTCTCGCC 
CATGACACTC ATACGAGTCA CCCATTAGGC AGCAACACAC CCAGTAATCC ACAAGCGCAA
CCGGCACCAC GACCACCCTC GGCGCCCATT AGCCAAATAG AATCCCATAG ACCGCTACTG
CGGGAGACGC CATTGATGGC ATCCCCGAAT GAGCCAACTA TGACAGAACA GACAATGACA
GAACCGCCTA AGATGCATTC ACGCAAGAAG CGAGAAGTAT CTGAGCAACC GCCACTTAAG
TTACAGAGTC GCAGCGCCGC ACTGCAGAGC AATGACTGCA GTGACTTTGT CGGTAAATCG
GGCCAAGACC TAGTGGATCA ATTGAGCCAA TCGACACCCG AATGCGTCGG TAAGCTCTAT
AGCCTCAAAG GCAGTGCTGC GACCGCGCTA TTTAGCGAGG CCAATGTGAT CAGTGTCGCC
AACGCCATTG CCACTAAAGC CAAGGACTAC ACTGGGGTCG ATGTTCAGCA TCTTGAATCG
CATATTTACT TTGTCCGCGC CGCCCTCTAT GTGCAGTTTT ACAGCCCAAA TGATGTACCC
GCTTACAGCA GCGCCGCTAA GGCCAGCTTA AAATCGGCTC TTAACGCTTT ATTTGCTAAC
GCTGCGATTT GGACTGTCTC CGATGACAAT GCTGGCGTGT TGAAAGAAGC CTTAATCCTG
ATTGACTCTG CCGAGCTCGG CGCCGATTTC AATCATGTCA CCATAAAAGT GCTAACGGAC
TACGACGCTA ACTGGCAAGC CAGTTTCGCC ATGAACGCCG CAGCCAACTC AGTGTTTACC
ACCTTATTCC GCGCCCAGTG GAATGATGAC ATGCAGGCGC TGTTCGCACG TGACCAAGGC
ATTTTAGATG CGCTGAATAA CTTCCAGCTC GAACACCGCG ACTTGTTAGG CACCAATGCA
GAGTATCTGT TAGTCAACTC AGTCAAAGAG TTATCAAGAC TGTATTACAT TGATGCCATG
CGCCCGCGAG TGACTCAGCT AGTTAAAAAC ATCCTCAGCA GCACCAGCAA AACCGAACCG
AGCAAAGTGC TTTGGTACGC GGCGGCAGAA ATGGCCGACT ATTACGATCG CAGCCATTGT
AACGATTACA ATATCTGTGG TTTTAAGGCG CAGTTAGAGG CCGATGCCTT ACCTTTCAAC
TGGAAATGCT CCGACAGTCT CAAGATCCGC GCCCAAGATC TCTATCAAGA TCAAGCCAAG
TGGGCCTGTG ATGTGCTGAC CAGCCAAGAA AGCTATTTCC ACAGTAAACT TGAAACGGGT
ATGCAACCTG TAGGGCAAGA TAACAACGAT GACTTAGAGC TAGTGATATT TGGCAGCTCG
TCTGAGTATA AATCCTTAGC CAACAGTATC TTTGGTATCA ATACCGACAA CGGCGGCATG
TACCTTGAAG GCTCGCCCGC CGGACTCAAA AATCAGGCGC GTTTTATCGC CTATGAAGCC
GAATGGCGCA CGCCGGATTT CCATGTCTGG AACCTGCAAC ACGAATATGT GCACTACCTC
GACGGGCGCT ATAACCTGTT TGGTGACTTT AGCCGCGGCA CGTCAGCCAA TACCATTTGG
TGGATTGAAG GCCTCGCAGA ATACATTTCT TACCGTGACG CCAATACCGC GGCGATTGCC
ATGGGCGAAA CCGGTGAGTT TATGTTGTCG ACCATATTCA AAAACAACTA TGAATCGGGC
CAAGACCGTA TCTACCGTTG GGGTTATCTG GCGGTGCGCT TTATGTTTGA GCATCACAGG
GACGATGTAA GACAGATTCT CGCCTACCTA CGTAACGACC AATACGCTGA ATATCAAACC
TTTATGGATG GTATTGGCAC ACGTTACGAC AACGAGTGGC AAGGTTGGCT CGCCAGCGGC
TTGAGCACGG CTGACGATGG TATAGTCGAT AAAGGCCCAA GTGATGTAGA TGCTGAGCCC
AGTGGCCGTG AAGGTAATTG GACTGGCCCT GCGGGCACTA TCAGTAAGGA TTACTCCCCT
TGCCAAGTGA CGAACGAAGC CTACCGTTAC ACTGAATCGG CCAGCCTGAA AATCGATGTG
CCGATGGAAT GCATCGACTC CAAACTCGGT CGCGCCAGTT TTAGCTTCAC TAATACCGAG
CGCTCGGCCC AAGATCTTTG GATCAAGATT GGCGGCGGTT GGGGTGATGC CGACATTTAT
TTCAATTCTA AAAGTTGGGC AAGCCCAGAA CAAAACCAAG GCGCGGGGAT CGGCAACGGT
AACTATCAAG TCATTAAAGT GCAGTTAAAC CCTGACGAAT ATTGGCACTA CATTACCTTG
TCGGGGGATT TTGGCGGCGT CGATATGCAA GTCAGCACCA CTGAGCTGTT CGCTGATGTT
GATCCCGATC TGGGAGATGG CGGCGTGGAT CCTGAGCCAC CAGCAAATTG CGGTGCAGTG
ACCTTAGATT ACGGCCAGCT CACACTGGGC AAAAACGAGT GTATCAGTGG TGGACGCAAC
AGCTTCTACT TCTGGGTTGA GGAAGATAAT ACTCAGTTCA CCGTCACCAC TAAGGGTGGC
TCTGGGGATG CCAATCTCTA CTTCAATGCC AGTCAGTGGG CCGACGCCGA TAATGCCGAT
GCCAAGAGCA CTCAAGTGGG TAATCAAGAG TCGCTAAGTT TTAGTGCCAA TCGCGGCTGG
CGTTATATCA CAGTGGATAC TGCCACTGAG TTTAGCGGCG TCACACTCAC CCTCAATACA
GGCGCGAGCA ATACGCCTAC GCCAGCTCTA ATCGCTAATG CCTGCACGAC GCAATCGCCC
TTGAGTCATG GTGAGCTGAG TTCAGGCAAA GCCATCTGCA CCGCCGATGG CCGCAGCGAT
TACTATCTTT GGGTGCCAGA AGGTACGAGT CAGTTAAGCA TCAACTCGGC CCATGGCAGT
GGTGATGTGA GCCTGTTTTC GGGGACAACT TGGGCCAATG CTCAACACTT CGATGCAGCG
TCTGTCACGC CCGCTAACAC TCAAGAAAGT ATCACAGTCG ATGCACCAAG CCTGGGTTGG
TATTACATCA CAGTGCAGAG TGAACCCCAG AGTTCAGGAG TCGCACTGCA GGTTGATTTA
CGCTAG
 
Protein sequence
MKNTLLFAAI SLAFATPVLA HDTHTSHPLG SNTPSNPQAQ PAPRPPSAPI SQIESHRPLL 
RETPLMASPN EPTMTEQTMT EPPKMHSRKK REVSEQPPLK LQSRSAALQS NDCSDFVGKS
GQDLVDQLSQ STPECVGKLY SLKGSAATAL FSEANVISVA NAIATKAKDY TGVDVQHLES
HIYFVRAALY VQFYSPNDVP AYSSAAKASL KSALNALFAN AAIWTVSDDN AGVLKEALIL
IDSAELGADF NHVTIKVLTD YDANWQASFA MNAAANSVFT TLFRAQWNDD MQALFARDQG
ILDALNNFQL EHRDLLGTNA EYLLVNSVKE LSRLYYIDAM RPRVTQLVKN ILSSTSKTEP
SKVLWYAAAE MADYYDRSHC NDYNICGFKA QLEADALPFN WKCSDSLKIR AQDLYQDQAK
WACDVLTSQE SYFHSKLETG MQPVGQDNND DLELVIFGSS SEYKSLANSI FGINTDNGGM
YLEGSPAGLK NQARFIAYEA EWRTPDFHVW NLQHEYVHYL DGRYNLFGDF SRGTSANTIW
WIEGLAEYIS YRDANTAAIA MGETGEFMLS TIFKNNYESG QDRIYRWGYL AVRFMFEHHR
DDVRQILAYL RNDQYAEYQT FMDGIGTRYD NEWQGWLASG LSTADDGIVD KGPSDVDAEP
SGREGNWTGP AGTISKDYSP CQVTNEAYRY TESASLKIDV PMECIDSKLG RASFSFTNTE
RSAQDLWIKI GGGWGDADIY FNSKSWASPE QNQGAGIGNG NYQVIKVQLN PDEYWHYITL
SGDFGGVDMQ VSTTELFADV DPDLGDGGVD PEPPANCGAV TLDYGQLTLG KNECISGGRN
SFYFWVEEDN TQFTVTTKGG SGDANLYFNA SQWADADNAD AKSTQVGNQE SLSFSANRGW
RYITVDTATE FSGVTLTLNT GASNTPTPAL IANACTTQSP LSHGELSSGK AICTADGRSD
YYLWVPEGTS QLSINSAHGS GDVSLFSGTT WANAQHFDAA SVTPANTQES ITVDAPSLGW
YYITVQSEPQ SSGVALQVDL R