Gene SO_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_0639 
Symbol 
ID1168501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp664604 
End bp667717 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content49% 
IMG OID637342626 
Productcollagenase family protein 
Protein accessionNP_716272 
Protein GI24372230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGA CTCTACTTTT CGCGGCTATT AGCTTAGCCA TTGCAACACC TTCGCTCGCC 
CATAACCATT CAGTACAAGA CCCAAACAAT TCAGTAAAAG CGAAAAACAC AGCAATCCCT
AGTACGCTAG CACCGTTAGC TTCACCGTCA GCGCCACTCA ATATTGATGT CAGCCATCGG
CCGTTATTGC CCACTGACAG TTTGATAGCC TCACCTAACG AGCCAACACA CTCTGAGTAC
CTCGATGCAC ATTCTTCACA ACAGCAAGCG TTGCTCGAGA ACTCTCCGAG ATCAAAAGTC
AGCCGCTTTG CCGCCAATGC CGACTGTAGT AACTTTGTTG GTAAATCGGG CCAAGCCCTG
TTAGATGAGC TAACGCAATC CACGCCACAG TGTGTCGGTA AACTATTTAA TCTCAAGGGA
AGTGACGCAA GCAGCGTATT TAGTGAAGCG AATGTATCCA CAGTTGCCAA TGCAATTGCA
GCCAAAGCGC CTCAATACAC AGGCGTGGAT AACCAAGGTA TTGAGTCACA TATTTACTTT
GTGCGCGCCG CGCTTTATGT GCAGTTTTAC CACCCCAGCG ACGTACCCGC CTACAGCAGC
GCGGTGAAAA ACAATTTAAA ATCAGCGCTT AATGCTCTCT TTGCCAATAA CGCCATTTGG
ACACTCTCCG ATGCCAATGC CAGCGTTCTC AAAGAAGCAC TGATCCTGAT TGATTCCGCC
GAACTGGGGG CTGATTTTAA CTTTGTCACC CTCAAAGTGC TCAATGACTA CAACAGCACA
TGGCAAGCCA GCTTTGCGAT GAATGCGGCC GCCAATACCG TGTTTACCAC CTTATTCCGC
GCCCAGTGGA ATACTGACAT GCAGGCACTG TTTGCCCGAG ATCATGCCAT TTTGGATGCG
CTCAACCAGT TCCAACTTAA CCACAGGGAC TTACTCGGCA CCAATGCTGA ATATGTTTTA
GTCAATGCTG TCAGAGAGTT ATCCAGACTG TACTACATTG ATGCCATGCG CCCTAAAGTC
ACCCAATTGG TCAAAAATAT TTTAAACAGC ACCAGCAAAA ATGATTCCAG TAGAGTGCTG
TGGTTTGCAG CTGCCGAGAT GGCAGATTAC TACGATCGCA GCAATTGTAA TGCTTACCAG
ATCTGCGGTT TCAAAGCACA ATTGGCCGCC GATAGCTTAC CGTTCAATTG GAAATGCTCC
GACAGCCTCA AGATCCGCGC CCAAGATCTT TACCAAGATC AAGCTAAATG GGCATGCGGA
GTATTAAGTC AGCAAGAAAG CCATTTCCAC ACGATACTCG AAACGGGAAT GCAGGCGGTC
GCACAGGATA ACAATGATGA TTTAGAGCTG GTGATTTTTG GCAGCTCGTC AGAATATCAA
TCCCTTGCCA ACAGCATTTT TGGGATCAAT ACCAACAATG GCGGCATGTA TTTAGAAGGC
TCTCCCGCTG GACTTAAAAA CCAAGCGCGT TTTATTGCCT ATGAGGCCGA GTGGCGACAA
CCTGATTTTC ATGTGTGGAA CTTACAGCAT GAATACGTGC ATTACCTCGA TGGCCGCTAT
AACTTGTTTG GCGACTTCAG CCGCAGCGTA TCCGCCAACA CAATTTGGTG GATTGAAGGA
TTAGCAGAAT ATATTTCATA TCGAGATGCC AACCCCGCTG CCATCGCCAT GGGTGAAACC
GGTGAGTTTA TGCTCTCAAC CATCTTTAAA AACACCTATG ACTCTGGCCA AGACCGCATT
TACCGTTGGG GTTACCTCGC CGTACGGTTC ATGTTTGAAA ACCACCGCGA TGATGTTCGC
CAGATCCTAA CCTTCCTGCG TAATAACCAA TATGCTGAAT ATCAAGCATT TATGGATTCC
ATTGGCACCC GCTATGACAA CGAATGGCGC GGCTGGCTCA CCAGCGGTTT AAGCACTACC
AACAATGGTA TTGTCGATAA AGGCCCAAGT GATGAACAGG CTAATGCCAG TGGTCGCGAA
GGCAACTGGG CAGGCCCCGC GGGCACCATC AGCAAAGATT ACTCGCCCTG CCAAGTCAGT
AACGAAGCAT ACCGCTACAG TGAGTCTGCC AGTCTCAGTC TTGAGGTACC CATGGAGTGT
ATTGATGCCA AACAAGGCCG AGCCAGCTTT AGCTTTGCCA ATAGTGACCG CTCGGCCCAA
GATATTTGGA TCAAAATTGG CGGTGGTTGG GGAGATGCCG ATATTTATTA CGATTCAAGG
GGATGGGCAA GTGCTGAAAA AAATCAGGGC TATGGCATCG GCAACGGTAA TTACCAAGTG
ATCAAAGTGA GCCTAAACCC CAATGAACTT TGGCACTATA TCACCCTAGA AGGGGATTTT
GGTGGTGTTG ATATGCTGGT CAGTACCAGT GAGTTAGTTG CCGATACCGA TCCCGACTTA
GGTGATGGCG ATACTGGCGG TGAAGTGCCC AGCAACTGTG GTGCCGCAAC CATCAATTAC
GGCAAGCTAA CCCTCGGAAA AGATGAATGT ATCAGCGGTG GTCGTAATAG CTTTTATTTC
TGGGTCGATG CTGACAACAG CCAATTTAGT GTCAGCACAA CCGGGGGAAC AGGCGATGCC
AATATTTACT TTAATGCCAA TACTTGGGCA AGTGCGAGCA ATGCCCAAGC CAGCAGTGTA
AATCAAGGCA ACAAAGAGTC CTTTAGTTTT ACGGCAAACC GTGGCTGGCG ATATATCACG
GTTGATACCG CGAGTGAATT TAGTGGCGTG ACCTTCAACC TTAAAGCCGG TGGTGGTGGC
AGTAGCGTTC CTAATCAAAT TGCCAATGCC TGTGCGACTA AATCACCCGT GAGCTATACC
CAACTCACGC CGGGTGATGC CGTCTGCAGC GCCAATGGCC GTAATGACTA TTATCTTTGG
ATTCCAGAAG GGACAAGCCA ACTAGAAGTA CGCTCGGCCC ACGGAACTGG AGATGTCAGC
CTTTACTCGG GTCGCAGCTG GGCCAATGCC CAGCAATATG AAGCCGCCTC AACAAACGCT
GGCAGCACTA AGGAACAAAT CAAGGTCAAT AATCCCAGTG CTGGCTGGTA TTACATTACG
CTCCAAAGTG AAGGCCAAAG TGCAGGTGTG GCCCTTCAGG TAGATTTACG CTAA
 
Protein sequence
MKQTLLFAAI SLAIATPSLA HNHSVQDPNN SVKAKNTAIP STLAPLASPS APLNIDVSHR 
PLLPTDSLIA SPNEPTHSEY LDAHSSQQQA LLENSPRSKV SRFAANADCS NFVGKSGQAL
LDELTQSTPQ CVGKLFNLKG SDASSVFSEA NVSTVANAIA AKAPQYTGVD NQGIESHIYF
VRAALYVQFY HPSDVPAYSS AVKNNLKSAL NALFANNAIW TLSDANASVL KEALILIDSA
ELGADFNFVT LKVLNDYNST WQASFAMNAA ANTVFTTLFR AQWNTDMQAL FARDHAILDA
LNQFQLNHRD LLGTNAEYVL VNAVRELSRL YYIDAMRPKV TQLVKNILNS TSKNDSSRVL
WFAAAEMADY YDRSNCNAYQ ICGFKAQLAA DSLPFNWKCS DSLKIRAQDL YQDQAKWACG
VLSQQESHFH TILETGMQAV AQDNNDDLEL VIFGSSSEYQ SLANSIFGIN TNNGGMYLEG
SPAGLKNQAR FIAYEAEWRQ PDFHVWNLQH EYVHYLDGRY NLFGDFSRSV SANTIWWIEG
LAEYISYRDA NPAAIAMGET GEFMLSTIFK NTYDSGQDRI YRWGYLAVRF MFENHRDDVR
QILTFLRNNQ YAEYQAFMDS IGTRYDNEWR GWLTSGLSTT NNGIVDKGPS DEQANASGRE
GNWAGPAGTI SKDYSPCQVS NEAYRYSESA SLSLEVPMEC IDAKQGRASF SFANSDRSAQ
DIWIKIGGGW GDADIYYDSR GWASAEKNQG YGIGNGNYQV IKVSLNPNEL WHYITLEGDF
GGVDMLVSTS ELVADTDPDL GDGDTGGEVP SNCGAATINY GKLTLGKDEC ISGGRNSFYF
WVDADNSQFS VSTTGGTGDA NIYFNANTWA SASNAQASSV NQGNKESFSF TANRGWRYIT
VDTASEFSGV TFNLKAGGGG SSVPNQIANA CATKSPVSYT QLTPGDAVCS ANGRNDYYLW
IPEGTSQLEV RSAHGTGDVS LYSGRSWANA QQYEAASTNA GSTKEQIKVN NPSAGWYYIT
LQSEGQSAGV ALQVDLR