Gene Shewmr4_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3423 
Symbol 
ID4253989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4093427 
End bp4094467 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content54% 
IMG OID638120061 
Productpeptidase M28 
Protein accessionYP_735546 
Protein GI113971753 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAATT TGGCACAGGC ATTCACAGCA ATCACTCGCC CCCGCGCTCA AGGCATTGGC 
TTGAGCTTAT TACGCTTGTG TATCCTCAGC TTGTGTTTGG GCCTTACGGC CTGCGCCAAT
CAACCCGTCG AATACACTTG TTCACCTGAG GCGATTCGAT TGAATTGGGC CGAACCCTCA
GTTCTTAAGC AAACGGTAGC AATACTCAGC GCCGCAGAAT TAATGGGGCG TAAAACCCAA
ACGCAAGGCG CCGCCAAGAC TCGCGACTAT TTGAACAGCC AGTTTCAGCA ACTCGGACTC
AAAGCGTGGG GAGAGACCTT CGAGGTGCCC TTCGAATATG CCACGCTTTT TAGCCAAGAG
ACGGGAAGCA ATATGGTAGC GTTAGTCCCC GCACGCCAAC CCACTCATCG ATGGCGCATT
GTGGTGGCTC ACTACGATCA TCTCGGCATG AGTGGCAGCA AGATTTACCA CGGCGCCGAT
GATAACGCCT CAGGTGTTGC CGCCCTGTTG GCTCTGGCCG CCCACTGGCA AGCGCAGCTA
AGCGCCGCCC CCGATTCGTT GCCCAACATT AACTTAATGT TTGTCGCGAC CGATGCCGAA
GAGCCGGGGC TGTTTGGCAG TACGGCCCTC GTCGAGCAAC TCAAGCAGCG CATGCCCGAG
GCGCAATTTG AACTGATGCT CAATCTCGAT ATGATTGGCC ATCCGACCCG ACCCTACGCT
ATTTACCTCG AAGGCAGCCG CAACTTTTAT CAGTTTCCAC AATTTAGGAC CATGCTAAAC
GCGAATAATC ACCTCTGTAT TAAGTTGAGC CATCCCAAAC CCGTGGGACG AAGCATCCAG
AGTACCGACT GGCTGAGAGC CTCGGATCAT TATCCTTTCC ATAAAGCCAA GATCCCTTGG
CTCTATTTTG GCGTCCCCAC TCATCCGCAA TACCATACCC CCGAGGATAC CCCCGACACC
TTGGACTATG TTTTCCTCGC GGCGGTGACA GAATCCGCCT TCGAAATCCT ACGACTCAAT
GGCGACTTTT TGAAAAATTA A
 
Protein sequence
MGNLAQAFTA ITRPRAQGIG LSLLRLCILS LCLGLTACAN QPVEYTCSPE AIRLNWAEPS 
VLKQTVAILS AAELMGRKTQ TQGAAKTRDY LNSQFQQLGL KAWGETFEVP FEYATLFSQE
TGSNMVALVP ARQPTHRWRI VVAHYDHLGM SGSKIYHGAD DNASGVAALL ALAAHWQAQL
SAAPDSLPNI NLMFVATDAE EPGLFGSTAL VEQLKQRMPE AQFELMLNLD MIGHPTRPYA
IYLEGSRNFY QFPQFRTMLN ANNHLCIKLS HPKPVGRSIQ STDWLRASDH YPFHKAKIPW
LYFGVPTHPQ YHTPEDTPDT LDYVFLAAVT ESAFEILRLN GDFLKN