Gene Shewmr4_2807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2807 
Symbol 
ID4253378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3358978 
End bp3360795 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content52% 
IMG OID638119442 
Productpeptidase M24 
Protein accessionYP_734935 
Protein GI113971142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.155072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTAT CATCATCGTC ACAATCCCTA CAGCCTAATA AGATTGCTAA CCGTCTTGCC 
GCCATTCGTA GTGAACTTGC CAGTGCCAAT TTAGATGCCT TTATCATCCC GCGCGCGGAC
GAATACTTAG GCGAATATGT GCCAGAGCAT AATGAGCGTT TGTACTGGGC AACGGATTTT
ACCGGCTCAG CGGGTATGGC TATTGTCTTA AAAGACAAAG CCGCCATCTT TACCGATGGT
CGTTATACGG TGCAAGTACG TCTGCAGGTG GATGCGAATC TCTTTAGTTA TGAAAGCCTG
ACCGACACAC CGCAAATCGA ATGGCTATGC GATACGCTTG CCGCAGGTTC ACGTGTGGGC
TTCGATGCCC GTTTACACAC TTTAGCTTGG TTTGAAAACG CCAAAGCGAT GTTAGCGAAA
GCCCAAATTG AATTGGTTGC CGTTGAACAG AATCCGATTG ATAAGCACTG GCAAAATCGA
CCTGCACCGT CGAGCGCGGC TATCACGTTA TTTAGTAATG AGAGTGCGGG TAAGACCAGC
CTGCAAAAAC GGACCGAGAT TGGCGCACTA GTCAAAAAAG CCGGCGCCGA TGTCGCGTTA
ATCGCCGCTT TAGACTCCTT CTGCTGGCTA CTCAATATCC GTGGTAATGA CGTACCGCGT
CTGCCAGTCG TGCTCGGCTG CGGGCTACTT CACGCCAATG GCGATATGCA GCTGTTTACC
GATTTAAACA AACTCCCTGA AGGCATTGAG GAACATGTTG GAGCGGGTGT GAGCTTTAAG
AGCGAAGCTT CCCTTGCCGA TACCTTGGCA AGCTTACAGG GCGTGAAACT GCTCGCCGAT
CCCAATTCTG CTAACGCTTG GGCACAAAAT ATCGCCCGCG ATGCAGGCGC CAAGTTAATT
GCCGGTATCG ACCCGGTCTC CCTGCCTAAG GCACAGAAAA ATGCCGCCGA ATTAGCGGGC
ATGCGCGCCA GCCATATCCG TGATGGTGTG GCCGTGAGTC GTTTCCTCGC TTGGCTCGAT
GCCGAGGTCG CAGCCAATCG TCTGCACGAT GAAGCCACCC TCGCCGACAA GTTGGAAAGC
TTCCGCCTCG AAGATCCACA ATATCGAGAG CCAAGTTTTG ATACTATTTC CGCCGCTGGC
GCCAATGCGG CCATGTGCCA CTACAACCAT AACAATGGCA CGCCAGCCAT GATGACGATG
AACAGCATCT ACCTTGTGGA TTCTGGCGCT CAGTATCTGG ACGGCACCAC AGATGTCACC
CGTACCATCG CCATTGGTAA CGTGACCGAT GAACAGAAGA AAATGGTCAC CTTGGTATTA
AAAGGCCATA TCGCTTTAGA TCAGGCCCGC TATCCGAAAG GCACGACAGG GCAGCAACTC
GATGCCTTCG CCCGTCAATA TCTGTGGCAA CATGGCTTTG ATTACGACCA CGGCACAGGC
CACGGCGTTG GTCATTTCCT CAGCGTGCAC GAAGGTCCGC AGCGCATCGG TAAAAACCTC
AATGCCATCG CCTTAATGCC AGGCATGGTG TTATCTAACG AGCCAGGGTA TTACCGCGCC
GACAGTTTTG GGATCCGCTT AGAAAACCTA GTGGTAGTCC AACACTGCGA AGCCTTAAAG
GGCGCCGAGC GTGAAATGTA TGAATTCGAT GCCTTAACGC TGATCCCAAT GGATGCTCGC
CTTATCGATA AGAGCCTGTT AACCCAAGGC GAAATCGACT GGTTTAATGC TTACCATCAG
AAAGTGTTTA ACACCTTATC GCCATTAATG TCGGGCAGTG AGCTTAAGTG GCTGACACAG
GCGACAAAAG CCATCTAA
 
Protein sequence
MSLSSSSQSL QPNKIANRLA AIRSELASAN LDAFIIPRAD EYLGEYVPEH NERLYWATDF 
TGSAGMAIVL KDKAAIFTDG RYTVQVRLQV DANLFSYESL TDTPQIEWLC DTLAAGSRVG
FDARLHTLAW FENAKAMLAK AQIELVAVEQ NPIDKHWQNR PAPSSAAITL FSNESAGKTS
LQKRTEIGAL VKKAGADVAL IAALDSFCWL LNIRGNDVPR LPVVLGCGLL HANGDMQLFT
DLNKLPEGIE EHVGAGVSFK SEASLADTLA SLQGVKLLAD PNSANAWAQN IARDAGAKLI
AGIDPVSLPK AQKNAAELAG MRASHIRDGV AVSRFLAWLD AEVAANRLHD EATLADKLES
FRLEDPQYRE PSFDTISAAG ANAAMCHYNH NNGTPAMMTM NSIYLVDSGA QYLDGTTDVT
RTIAIGNVTD EQKKMVTLVL KGHIALDQAR YPKGTTGQQL DAFARQYLWQ HGFDYDHGTG
HGVGHFLSVH EGPQRIGKNL NAIALMPGMV LSNEPGYYRA DSFGIRLENL VVVQHCEALK
GAEREMYEFD ALTLIPMDAR LIDKSLLTQG EIDWFNAYHQ KVFNTLSPLM SGSELKWLTQ
ATKAI