Gene Shewmr4_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1064 
Symbol 
ID4250769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1245464 
End bp1246828 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content51% 
IMG OID638117637 
Productpeptidase U32 
Protein accessionYP_733201 
Protein GI113969408 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAC CTGAGTTGTT ATCTCCCGCT GGGACGCTGA AAAACATGCG TTACGCTTTT 
GCCTATGGTG CAGATGCCGT GTATGCCGGC CAGCCGAGAT ACAGCCTGAG GGTTCGTAAT
AACGACTTTA AAATGGAAAA CCTCGCGACG GGTATCGAAG AAGCCCATGC GTTGGGTAAA
AAGCTTTATG TGGTGAGTAA CATTGCTCCC CACAACGCCA AGCTCAAAAC CTATATCAAA
GATATGGAAC CGGTAGTGGC GATGAAGCCC GATGCGCTGA TCATGTCAGA TCCTGGCCTT
ATCATGATGG TACGTGAGGC CTTCCCTGAG CAGGTGGTGC ATTTATCGGT GCAAGCCAAC
GCCATTAACT GGGCATCGGT CAAATTCTGG CAGACCCAAG GCATTAAACG GGTAATTTTA
TCCCGCGAAT TATCCTTAGA TGAAATCGAA GAAATCCGTC AACGCTGCCC CGATATCGAA
CTAGAAGTGT TTGTCCACGG CGCCCTGTGT ATGGCTTACT CTGGCCGTTG TTTACTGTCG
GGTTATATCA ATAAGCGCGA TCCAAACCAA GGCACTTGCA CTAACGCCTG CCGCTGGAAA
TACGATGTGC ACGAAGCGCA GCAAACTGAC TCTGGCGATA TCATTGCCAC CCCCAATGCG
GTGCAAATCG AGACGCCAAC CTTGGGCACG GGTCCTGCGA CCGACCAAAT CTTCCTGCTG
CAAGAAGCCA ATCGCCCCGG CGAATATATG CCAGCGTTTG AAGATGAGCA TGGCACTTAT
ATCATGAACT CTAAGGACTT GCGCGCAATC CAACACGTTG AGCGTTTGGC GAAAATGGGC
ATCGACTCGC TGAAGATCGA AGGCCGTACT AAGTCGTTCT ACTATGTTGC CCGTACCGCC
CAGCTATACC GTCAGGCCAT CGACGATGCC GCCTCAGGCA AGAGCTTCGA TCGCAGCCTG
ATGAACCAAC TCGAAGGCTT AGCACACCGC GGCTATACCG AAGGTTTCTT ACGTCGCCAC
GTACATGATG AATATCAAAA CTACGACTAT GGCTACTCGG TCAGCGATAC CCAACAATTT
GTGGGCGAAT TAACCGGTAA ACGCAATCTG GCTGGCCTTG CCGAAATCGA AGTGAAGAAC
AAATTCTCTG TCGGTGATAG TGTTGAATTA ATGACGCCAC AGGGCAATAT CAGCCTCACC
ATCGAGCAAC TCGAGAACCG CAAAGCCGAA TCGGTTGAAG CGGGATTAGG TTCGGGCCAT
ACCGTTTACT TGCCCGTGCC GAAAGAGGTT GATCTGAACC ACGGCATTTT ACTGCGTAAC
CTGCCCCAAG GTCAGGATAC CCGTAACCCA CACGAAGCAG GCTAA
 
Protein sequence
MFKPELLSPA GTLKNMRYAF AYGADAVYAG QPRYSLRVRN NDFKMENLAT GIEEAHALGK 
KLYVVSNIAP HNAKLKTYIK DMEPVVAMKP DALIMSDPGL IMMVREAFPE QVVHLSVQAN
AINWASVKFW QTQGIKRVIL SRELSLDEIE EIRQRCPDIE LEVFVHGALC MAYSGRCLLS
GYINKRDPNQ GTCTNACRWK YDVHEAQQTD SGDIIATPNA VQIETPTLGT GPATDQIFLL
QEANRPGEYM PAFEDEHGTY IMNSKDLRAI QHVERLAKMG IDSLKIEGRT KSFYYVARTA
QLYRQAIDDA ASGKSFDRSL MNQLEGLAHR GYTEGFLRRH VHDEYQNYDY GYSVSDTQQF
VGELTGKRNL AGLAEIEVKN KFSVGDSVEL MTPQGNISLT IEQLENRKAE SVEAGLGSGH
TVYLPVPKEV DLNHGILLRN LPQGQDTRNP HEAG