Gene SO_3797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_3797 
Symbol 
ID1171441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp3948765 
End bp3950681 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content47% 
IMG OID637345567 
ProductU32 family peptidase 
Protein accessionNP_719334 
Protein GI24375291 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC CAGAGATTTA CACACCTGAG AAACTTTCTC ACGTTAATAA CCGTTTAGAG 
TTATTGGCGC CCGCTAAAAA TGCCGATTAT GGTATTGAAG CCATTCGCCA TGGTGCCGAT
GCGGTTTATA TCGGTGGTCC TGCATTTGGG GCGCGTGCGA CGGCGGGTAA CAGTGTGGAG
GATATCGCGC GCCTGTGTAC TTTTGCGCAT AAGTACCATG CTCAGGTGTT TGTCGCCCTT
AATACCATTT TGATGGACGA TGAACTGGCA GTGGCTGAAA AACTGATTTG GGACGTGTAT
AACGCTGGCG CCGATGCGCT GATTGTGCAG GATATGGGGG TTTTACAGCT CAACCTGCCG
CCGATTGCAC TGCACGCCAG TACCCAAATG GATAACCGTA ATCCCGAGAA AGTCGCCTTT
TTGGAGCAAG TGGGATTCTC CCAAGTGGTG TTAGCTCGCG AGCTTGGTTT AAGCCAAATT
CGTGAGGTTG CGGCTCACAC CAATATGCAG ATTGAGTTCT TTATCCATGG CGCGCTTTGC
GTTGCTTATA GTGGCCTATG TAACTTAAGT CATGCCTTTA GTAATCGCAG TGCTAACCGC
GGTGAATGTT CGCAAATGTG CCGCTTACCG GGTAATCTTA AGACTCGTCA AGGGGATGTG
TTAGCGAAAA ATGAGCACTT ACTTTCATTA AAAGACAATA ATCAAACCGA TAACCTTGAA
GCCCTGATCG ATGCTGGCGT ACGTTCGTTC AAAATCGAAG GCCGTTTAAA AGATTTAAGT
TACGTGAAGA ACGTGACTGC TCATTATCGC CAAAAACTCG ATGCTATTAT GGCGCGTCGC
CCAGAGTTTG TGGCTTCATC CCATGGCCGT ACTGAGCATA CTTTTACCCC AGATCCAGAA
AAAACCTTTA ACCGTGGCAG TACCGACTAC TTTGTCCATG AGCGTAGCCA AGGGATTAAA
GATTTCCGCT CGCCTAAATA TATTGGCCAA GATGTCGGTA AAGTGGTCGC TATTGGTAAG
GACTTTATTC AAGTCAGTTC AACCCACGAA TTTAATAACG GTGATGGTTT AGCTTATTTT
CCACCCAATT ATGCGATGGC AAAGCAGTCC GACGATAAGT TGCAAGGTTT ACGGGTAAAC
CGCGCTGAAG GCCATAAGCT GCATGTGTTA CAAGTCCCGC GGGATTTACG TGTTGGTATG
ACCTTATACC GTAACCATAA TCAGGCATTC GAAGCCTTGT TAGCTAAAGA GTCGGCCAAG
CGTATTATTG CTGTGGACAT GCGCTTAATC GATACGACGG CGGGTGTGGC ACTGACCTTA
ACGGATATGT ATGGCTTGAG CGCCTCGGTT GAGTTAGCGG TAGAAAAAAC GCCGGCAACC
GATGCTGAAA AAACCTTACA GACTATTCGT ACACAACTGT CAAAATTGGG TAGCACCGAC
TTTGTTGCTC GTCAAATCAG CATTGAAACT GCCGAGCCTT GGTTCCTGCC TGCATCAACG
CTCAATGGTC TTCGCCGTGA TGCGGTTGCT GCACTTGAAC TTGCCCGCAT AGAAGGCTAC
CAACGGCCAA AACCTTGGAA ATATAACCAA GATGCCGTCT ATCCATTCAA ACACTTAAGT
TACTTAGGTA ACGTGGCAAA CGAAAAGGCC AAAGATTTTT ATCAACGCCA TGGCGTGATT
GAAATTCAAG ATACCTATGA GAAAAACGGC GTCACTGAAG ACGTGCCTTT GATGGTGACT
AAGCACTGTC TGAGATTTAA CTTTAATCTT TGCCCTAAGG AAGTACCAGG TATCAAGGCT
GACCCTATGG TGCTCGAAAT TGGTAACGAT GTACTTAAGT TAGTATTTGA TTGTCCTAAG
TGCGAAATGC TTGTTGTCGG TGAAAACCGC CAGGTTCGCG GCCAAAAAGC ACTTTAA
 
Protein sequence
MSQPEIYTPE KLSHVNNRLE LLAPAKNADY GIEAIRHGAD AVYIGGPAFG ARATAGNSVE 
DIARLCTFAH KYHAQVFVAL NTILMDDELA VAEKLIWDVY NAGADALIVQ DMGVLQLNLP
PIALHASTQM DNRNPEKVAF LEQVGFSQVV LARELGLSQI REVAAHTNMQ IEFFIHGALC
VAYSGLCNLS HAFSNRSANR GECSQMCRLP GNLKTRQGDV LAKNEHLLSL KDNNQTDNLE
ALIDAGVRSF KIEGRLKDLS YVKNVTAHYR QKLDAIMARR PEFVASSHGR TEHTFTPDPE
KTFNRGSTDY FVHERSQGIK DFRSPKYIGQ DVGKVVAIGK DFIQVSSTHE FNNGDGLAYF
PPNYAMAKQS DDKLQGLRVN RAEGHKLHVL QVPRDLRVGM TLYRNHNQAF EALLAKESAK
RIIAVDMRLI DTTAGVALTL TDMYGLSASV ELAVEKTPAT DAEKTLQTIR TQLSKLGSTD
FVARQISIET AEPWFLPAST LNGLRRDAVA ALELARIEGY QRPKPWKYNQ DAVYPFKHLS
YLGNVANEKA KDFYQRHGVI EIQDTYEKNG VTEDVPLMVT KHCLRFNFNL CPKEVPGIKA
DPMVLEIGND VLKLVFDCPK CEMLVVGENR QVRGQKAL