Gene Shewmr4_3138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3138 
Symbol 
ID4253709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3757029 
End bp3758945 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content49% 
IMG OID638119780 
Productpeptidase U32 
Protein accessionYP_735266 
Protein GI113971473 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC CAGAGATTTA CACACCTGAG ACACTTTCTC ACGTTAATAA CCGCTTAGAG 
TTATTGGCGC CTGCAAAAAA TGCCGATTAT GGCATTGAAG CCATTCGCCA TGGTGCCGAC
GCGGTTTATA TCGGTGGCCC CGCATTCGGC GCGCGTGCGA CCGCGGGTAA CAGTGTGGAA
GATATCGCCC GTCTGTGTAC TTTTGCTCAT AAGTATCATG CGCAAGTGTT TGTGGCTATT
AACACTATTC TGATGGACGA TGAGCTCGAA ACCGCTGAAA AGCTGATTTG GGATGTGTAT
AACGCGGGCG CCGATGCACT GATTGTGCAG GACATGGGCG TATTGCAACT TAACCTGCCG
CCGATTGCGC TACATGCTAG TACGCAAATG GATAACCGTA ATCCCGAGAA AGTCGCCTTC
TTAGAGCAAG TGGGTTTCTC ACAAGTAGTA TTGGCGCGTG AGTTAGGCTT AAGCCAAATC
CGTGAGGTTG CGGCTCACAC CAATATGCAG ATTGAGTTCT TTATTCACGG CGCCCTGTGT
GTGGCCTACA GTGGATTATG TAACTTAAGT CATGCCTTCA GTAACCGCAG TGCAAACCGT
GGCGAATGTT CGCAAATGTG TCGTCTGCCG GGCAATCTTA AGACCCGCCA AGGGGATGTG
TTGGCGCAAA ATGAGCACTT ACTCTCATTA AAAGACAATA ACCAAACCGA TAACCTCGAA
GCCTTGATTG ATGCCGGCGT TCGCTCCTTC AAAATTGAGG GGCGTTTAAA GGACTTAAGT
TATGTTAAAA ACGTGACCGC CCATTATCGT CAAAAGCTCG ATGCCATTAT GGCGCGTCGC
CCTGAGTTTG TAGCGTCATC CCATGGCCGT ACTGAACATA CCTTTACTCC GGATCCTGAA
AAAACCTTTA ACCGTGGCAG CACAGATTAC TTTGTGAATG AGCGTAGCCA AGGGATTAAA
GACTTCCGCT CGCCAAAATA TATCGGGCAA GATGTGGGTA AAGTGGTCGC CATCGGTAAA
GACTTTATTC AAGTCAGTTC AACCCACGAG TTTAATAACG GCGATGGTTT AGCGTATTTC
CCGCCAAACT ATGCGATGGC CAAACAGTCC GATGACAAAT TGCAGGGACT GCGTGTTAAC
CGTGCCGAAG GTCATAAGCT GCATGTATTG CAGGTGCCGC GCGATCTGCG TATCGGTATG
ACCTTATACC GTAACCATAA CCAAGCTTTC GAGACGTTAC TTTCTAAGGA GTCAGCCAAG
CGTATTATCG TCGTCGACAT GTGTTTAACC GATACTGCTA CGGGCGTGGC GCTGACCTTA
ACGGATATTT ACGGCCTCAG TGCAACGGTT GAGCTTGCAG TCGAAAAAAC GCCCGCCACC
GACGCTGAAA AAACCTTGCA GACTATCCGT ACTCAATTGT CGAAACTTGG TAGTACCGAT
TTTACAGCGC GCCAGATTAG TATCGAAACC GTCGAGCCTT GGTTCCTGCC TGCATCTGTG
CTCAACGGCC TGCGCCGTGA TGCGGTCGCA GCATTAGAAC TTGCCCGTGT TGAAGGTTAC
CAGCGTCCAA AACCTTGGAA ATATAATCAA GATGCCGTGT ATCCATTCAA ACACTTAAGT
TACTTGGGTA ACGTGGCAAA CGAAAAGGCG AAGGACTTTT ACCAACGCCA TGGCGTGATT
GAAATTCAGG ACACCTACGA GAAAAATGGC GTGACCGAAG ATGTGCCGTT AATGATCACT
AAGCATTGCC TGCGATTTAA CTTTAATCTC TGTCCTAAGG AAGTGCCGGG CATTAAGGCG
GATCCTATGG TGCTTGAGAT AGGTAACGAT GTGTTGAAGT TGGTCTTCGA CTGTCCAAAA
TGCGAGATGA TGGTCGTCGG TGAAAACCGC CAGGTTCGCG GTCAAAAAGC CGTTTAA
 
Protein sequence
MSQPEIYTPE TLSHVNNRLE LLAPAKNADY GIEAIRHGAD AVYIGGPAFG ARATAGNSVE 
DIARLCTFAH KYHAQVFVAI NTILMDDELE TAEKLIWDVY NAGADALIVQ DMGVLQLNLP
PIALHASTQM DNRNPEKVAF LEQVGFSQVV LARELGLSQI REVAAHTNMQ IEFFIHGALC
VAYSGLCNLS HAFSNRSANR GECSQMCRLP GNLKTRQGDV LAQNEHLLSL KDNNQTDNLE
ALIDAGVRSF KIEGRLKDLS YVKNVTAHYR QKLDAIMARR PEFVASSHGR TEHTFTPDPE
KTFNRGSTDY FVNERSQGIK DFRSPKYIGQ DVGKVVAIGK DFIQVSSTHE FNNGDGLAYF
PPNYAMAKQS DDKLQGLRVN RAEGHKLHVL QVPRDLRIGM TLYRNHNQAF ETLLSKESAK
RIIVVDMCLT DTATGVALTL TDIYGLSATV ELAVEKTPAT DAEKTLQTIR TQLSKLGSTD
FTARQISIET VEPWFLPASV LNGLRRDAVA ALELARVEGY QRPKPWKYNQ DAVYPFKHLS
YLGNVANEKA KDFYQRHGVI EIQDTYEKNG VTEDVPLMIT KHCLRFNFNL CPKEVPGIKA
DPMVLEIGND VLKLVFDCPK CEMMVVGENR QVRGQKAV