Gene Shew_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_1685 
Symbol 
ID4920448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp1935281 
End bp1938187 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content57% 
IMG OID640163247 
Productcollagenase 
Protein accessionYP_001093811 
Protein GI127512614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000629981 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000468189 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACAACCA ATATGACAAA TAAAACATTA GGCGGCGCTC GCCGCTTCAC CCCAACCGCG 
CTGGCCTTGG CCTGCTCGCT ACTGGCCACC CCGCTGGTGC TGCACGCTCA AGATGTTCAG
CCACAAGGCA TGCCGGCAAA GGATGTAAAG GGCCCGCTTT CGATAAAAGC CGCACCGTCG
ACAAAAGCCG CGCCGCCGAC GAAAGCCGCA CCTGCCCAGA AGGTGAAGCC GTCTGCCGAC
AAGGCCGCTG CGAGTTCTCG ATTCGACACG TCTGCGCGCA AGCCCATAGG GGCACGTGAT
GCGGAGCAGG GCGAAGGCGT GAGGCCTGAG AAGCGTGATA GCAAATTGGA GAAGGGGAGT
AAGGGGCAAT CGGCGCTGCC AGGCATGAAG GGCAATCAAG GCGCGACCGG GCCTGCGGCC
GAGGCGGATG CCTGTAATGG TTACAGCCAG TTCGACAGCT TGAGTGGTCA GGCGCTGTAC
GATTTTGTTC GCCAGGCGGA GTTTTCCTGT ATCTCCGAGC TCTATTCTCA CAACGATGCC
ACCGCGGTCA AGGTGTATCA GGCCGACAAT GTGGTGGCCG TGGCGAATCT GGCCAAGGCA
ACGGCTGCAA GTTACGATAG CAGCACCGGC GGCGAGCTGT TTAACCTGTT CTACTTCCTG
CGAGGCGCCT TCTACATAGA GTATTACAAC GATGATCTCA GCTATGGTGA CAGCCGCGCC
AGCGATGCCA TGCGTGAGCT GCTACTGGAA TATGCCAAGA ACCCGGCCTT TAACAGCCTG
AGCGACATGC AGGGCAACAC ACTGCAGGAA TATTTCATCG CCTGGGACAG CTCCTACAAC
TACTATGATT CGGTGGCCGT GATCACCGAT TATCTGAATC AGTTTAGCGA GCAGCATCTG
GCCTCCTGGT ACCACAGAAG CGCGCTGACC AAGGCGCTCA CCACCCTGTA TCGCGGCAAC
TGGGATGAAA AATATACCAA GGCGTCGATG GAGTATGATG CCCTGCGCGC CGCCTTGCTC
AAGGTGGCGA CCTCGGACTA CATCATCAAC TCAGAGTATG CTTTCGAGTC CACGGATGCC
TTTCACGAGT TTGGACGTTT CTACGAGTAT CAGGCCTACT GGAAACTACC GGATAGCCTG
AAAACGGCCC TCAACGGCGG CGTGCAGCAA TATATGGCCA AGTTTAACCG TCTGTCTAAG
CAGTGGGTGG ATGCGGCCGG TTATCTGGAT TACTACAACC CTGGCCAGTG CGACAGCTTC
GGCATCTGCG GCTGGGAGGA TGAGGTGGAA GCCAGTGTGC TTGCTATTCA CTACAGCTGC
TCGGACACCA TCAAGATCCG CGCTCAGCAG CTGACGGATC AGGAGCTGCA GGCCTCTTGC
GAGCTCATGG GTGAGGAAGA GGGGCTGTTC CATCAGATCC TGGCGACCGG CGGCGAGCCA
GTGGCGGACG ACTACAACGA AGATCTGCAG GTCAATATCT TCGATAGCTA TGACGACTAT
GATGTCTACG CCGGCATCAT CTTCGGCATC AACACTGACA ACGGCGGCAT GTACCTGGAA
GGCACGCCGT CGGATCCCGA CAACCAGGCA CGTTTTATCG CCCACGAGGC CACTTGGACC
GACGACATCT TGGTGTGGAA CCTGAGACAC GAATATGTGC ACTACCTGGA TGGTCGCTTC
AATCTTTACG GCGCCTTTAA CTACTTCGAT GTCGACACGG GTAAGTCTGT GTGGTGGGCC
GAGGGCCTCG CCGAATACAT CTCCCATCAG AATCGCTACG ATGAGGCGAT CGATATCGGC
CGCAGCCAAG AATTTAGCCT GAGCGAGATC TTGTCTAACA CCTATGACAG CGGTACCGAC
AGGGTCTATC GCTGGGGCTA TCTGGCGGTG CGCTTCTTGT TCGAGCAGCA CAGAAGCGAT
GTGGACGCTT TGTTAGTTCA CGCCCGCGCC GGTGATACCT CGGCCTGGCT CAGCTATATC
GACAACACTA TCGGCACGAA CTATGACGCA GAGTGGAACA GCTGGCTGCT GACGGTGACT
AGTAACGATA CGCTGCTCGA CGGTGGCATA GTGACCCCGG TCGACAGCGA TGGTGACGGG
GTGATCGACA GCGAAGATGC CTTCCCCAAC GATCCGACCG AGTGGGCGGA TGAGAATGGT
AACGGTATCG GCGATAACGC CGATGCGGCC AATGGTGGTG GGCAGACGGG GAACTGCGGC
GCCGCAACCA TCAGCGACGG CAATATTACC CAGGAGCAGG CCGAGTGTGT GGCAGGCACA
GGGGTGAACT ACTACTATAC CTATGTCGAG CAGGATAATA CTCAGCTCTA TATCAGCACC
ACGGGCGGTG AGGGCGATCT GGATATCTAC TTTAACCAGC AGACCTGGGC GAGCCCGAGC
GACTATGAGG TGAAGTCGCA AAACAGCGGC AACGAGGAGC TGATCAGCGT GGTCGCCAAT
CGTGGTTGGG TCTATCTCTC GACGGTAGCC GTGACGCCAT TTGAGGGCGT TAGCCTCAAG
ATCAGCCAGA GCGCCGACAC TACGCCGGAT AACGGTTCGC CTCAGGTGGC GGATGCCTGC
CTAAGCCAAA GCCCTTATAG TTATGGCGGG GTAGAGTTTG GTCAGGCGGT GTGTGTGGAC
GACGGCCATT CCAGCTATTA CTTCTATGTG CCGGCCAATA CGGCGGCTAT CGAGATCAAC
ACGGCCCATG GTAGCGGCGA TCTGAACCTC TACGCCAACG CAACGACCTG GGCGAGCCCG
ACGGCTTATC AGTTCAAGTC GGAAAACGTC GGCAACAGCG AGCAGATCCG CGTCGTTTCA
CCTGCCGAAG GCTGGTTCTA TGTCAGCGCC GATGGAGCGC CCTCAAGCAG TGGCGCCAGT
TTGCTGGTTA CCCTGGCTAG CAACTAA
 
Protein sequence
MTTNMTNKTL GGARRFTPTA LALACSLLAT PLVLHAQDVQ PQGMPAKDVK GPLSIKAAPS 
TKAAPPTKAA PAQKVKPSAD KAAASSRFDT SARKPIGARD AEQGEGVRPE KRDSKLEKGS
KGQSALPGMK GNQGATGPAA EADACNGYSQ FDSLSGQALY DFVRQAEFSC ISELYSHNDA
TAVKVYQADN VVAVANLAKA TAASYDSSTG GELFNLFYFL RGAFYIEYYN DDLSYGDSRA
SDAMRELLLE YAKNPAFNSL SDMQGNTLQE YFIAWDSSYN YYDSVAVITD YLNQFSEQHL
ASWYHRSALT KALTTLYRGN WDEKYTKASM EYDALRAALL KVATSDYIIN SEYAFESTDA
FHEFGRFYEY QAYWKLPDSL KTALNGGVQQ YMAKFNRLSK QWVDAAGYLD YYNPGQCDSF
GICGWEDEVE ASVLAIHYSC SDTIKIRAQQ LTDQELQASC ELMGEEEGLF HQILATGGEP
VADDYNEDLQ VNIFDSYDDY DVYAGIIFGI NTDNGGMYLE GTPSDPDNQA RFIAHEATWT
DDILVWNLRH EYVHYLDGRF NLYGAFNYFD VDTGKSVWWA EGLAEYISHQ NRYDEAIDIG
RSQEFSLSEI LSNTYDSGTD RVYRWGYLAV RFLFEQHRSD VDALLVHARA GDTSAWLSYI
DNTIGTNYDA EWNSWLLTVT SNDTLLDGGI VTPVDSDGDG VIDSEDAFPN DPTEWADENG
NGIGDNADAA NGGGQTGNCG AATISDGNIT QEQAECVAGT GVNYYYTYVE QDNTQLYIST
TGGEGDLDIY FNQQTWASPS DYEVKSQNSG NEELISVVAN RGWVYLSTVA VTPFEGVSLK
ISQSADTTPD NGSPQVADAC LSQSPYSYGG VEFGQAVCVD DGHSSYYFYV PANTAAIEIN
TAHGSGDLNL YANATTWASP TAYQFKSENV GNSEQIRVVS PAEGWFYVSA DGAPSSSGAS
LLVTLASN