Gene Shewmr4_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2051 
Symbol 
ID4252624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2441480 
End bp2443342 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content51% 
IMG OID638118667 
Productpeptidyl-dipeptidase A 
Protein accessionYP_734181 
Protein GI113970388 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0115163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.257719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTC TATTGAATCG TCCCACCACT CTTGCGCTCA CCATAGCGCT AACCTTAGGT 
TTAACCGCCT GTAATGATGC CCAAAGCAAG GCCGAAACCA CTGCAGCGGC ACCAGCTGCA
ACTCAGGCCG CGCCCGATAA AGCTCAGGCC ATCGCCTTTA TTCAAGATGC TGAAGCCCAG
ATGGCGCAGC TCTCTATCGA AGCGAATCGC GCCGAATGGA TTTACAGTAA CTTCATCACC
GAAGATACCG CTGCACTCTC GGCCGCAGTG GGTGAAAAAG TCAGTGCCGC GTCGGTCAAA
TTCGCCACCG AAGCCGCCAA GTTCGCCAAT GTGGAGCTCG ATCCTGCCAA TGCGCGTAAA
CTGAATATTC TGCGCAGCGC ACTCGTGTTA CCTGCGCCGC TTGATCCCGC CAAAAATGCT
GAGCTGGCGC AAATCAGCTC CGAGCTCAAT GGACTCTATG GTAAAGGCAA ATACTGTTTC
GCCGATGGCA AGTGTATGAC CCAGCCAGAG CTGTCGAGCC TGATGGCGGA ATCGCGTGAC
CCCGCCAAAC TGCTTGAGGC ATGGAAAGGC TGGCGTGAAA TTGCAAAACC GATGCGCCCG
CTATTTCAAC GTGAAGTTGA ACTCGCCAAC GAAGGTGCAA AAGATCTAGG TTACGCTAAC
CTCTCTGAGT TATGGCGCAG CCAATACGAT ATGAAACCCG ATGAATTTTC ACAGGAATTA
GATCGTTTAT GGGGTCAAGT TAAGCCTCTC TACGAATCCC TGCATTGTTA TGTTCGCGGC
GAGCTGAATA ACGAATACGG TGACGCCATC GCGCCAAAAA CTGGCCCTAT TCCCGCACAC
TTACTTGGAA ATATGTGGGC GCAGCAATGG GGTAATGTGT ACGATCTCGT CGCCCCTGAC
GATGCCGATC CTGGCTACGA TGTGACAGAA TTACTCGAGC AAAAAGGCTA TGATGAACAC
AAGATGGTCA AACAGGCCGA AAGCTTTTTC ACCTCGTTAG GCTTTGCGCC GCTGCCAGAC
AGTTTTTGGA GCCGCTCTTT ATTCTTGCAA CCCAAGGATC GTGATGTCGT TTGTCATGCC
TCGGCGTGGG ATTTGGATAA CCTCGACGAT ATTCGCATCA AGATGTGTAT CCAGAAAACC
GCTGAAGACT TCACCGTTAT TCACCACGAA CTTGGGCACA ACTTCTATCA ACGTGCCTAT
AAACAGCAGC CTTTCCTGTT TAAAAACAGT GCCAATGATG GTTTCCATGA GGCCATTGGT
GACACGATCG CGCTATCAAT TACCCCAAGC TATCTAAAAC AGATTGGCTT GCTCGAAGAC
GTGCCCGATG CTTCCAAGGA TATTGGCTTA CTGCTGAAAC AAGCTCTGGA TAAAATCGCC
TTCCTACCCT TTGGTTTGAT GATTGACCAG TGGCGCTGGA AAGTCTTTAG CGGTGAAATC
ACTCCAGCAC AATATAACCA AGCCTGGTGG GAATTAAGAG AGAAATACCA AGGCGTTAAG
GCGCCGACAG ACCGCAGCGA AGCCGATTTT GACCCAGGTG CCAAGTACCA TGTGCCCGGT
AACGTGCCTT ATACGCGCTA CTTCCTCGCG CATATTCTAC AATTCCAGTT CCATAAGGCA
CTGTGTGATA CCGCTGGCGA TAAAGGGCCT GTGCACAGAT GCAGTATTTA TGGCAACCAA
GCTGCAGGGG AAAAGCTCAA TAAAATGTTA GAGCTGGGTG CAAGTCAACC TTGGCCAGTT
GCTTTAAAAG AAGTGACAGG TACTGAGGGG ATGGATGCCA AAGCCGTACT CGATTATTTT
GCGCCGCTTA AAACCTGGCT CGATGAGCAA AACACGGCGG CGAATCGTCA ATGTGGTTGG
TAA
 
Protein sequence
MAILLNRPTT LALTIALTLG LTACNDAQSK AETTAAAPAA TQAAPDKAQA IAFIQDAEAQ 
MAQLSIEANR AEWIYSNFIT EDTAALSAAV GEKVSAASVK FATEAAKFAN VELDPANARK
LNILRSALVL PAPLDPAKNA ELAQISSELN GLYGKGKYCF ADGKCMTQPE LSSLMAESRD
PAKLLEAWKG WREIAKPMRP LFQREVELAN EGAKDLGYAN LSELWRSQYD MKPDEFSQEL
DRLWGQVKPL YESLHCYVRG ELNNEYGDAI APKTGPIPAH LLGNMWAQQW GNVYDLVAPD
DADPGYDVTE LLEQKGYDEH KMVKQAESFF TSLGFAPLPD SFWSRSLFLQ PKDRDVVCHA
SAWDLDNLDD IRIKMCIQKT AEDFTVIHHE LGHNFYQRAY KQQPFLFKNS ANDGFHEAIG
DTIALSITPS YLKQIGLLED VPDASKDIGL LLKQALDKIA FLPFGLMIDQ WRWKVFSGEI
TPAQYNQAWW ELREKYQGVK APTDRSEADF DPGAKYHVPG NVPYTRYFLA HILQFQFHKA
LCDTAGDKGP VHRCSIYGNQ AAGEKLNKML ELGASQPWPV ALKEVTGTEG MDAKAVLDYF
APLKTWLDEQ NTAANRQCGW