Gene Shewmr4_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0019 
Symbol 
ID4250705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp23767 
End bp25086 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content49% 
IMG OID638116558 
Productproline dipeptidase 
Protein accessionYP_732158 
Protein GI113968365 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0806227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000125098 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCACT TGGCTCATCA CTATCATGCC CACATCGCCG AGCTTAATCG TCGAGTCGCT 
GAAATTGTAT CGCGTGAAGC ATTATCAGGT TTAGTGATCC ACTCGGGTCA GCCACATCGA
ATGTTTTTGG ACGACATTAA TTATCCTTTT AAAGCCAATC CCCACTTCAA GGCTTGGTTA
CCTGTGTTGG ATAACCCTAA CTGCTGGTTA GTAGTAAACG GTCGCGATAA GCCACAACTG
ATTTTTTATC ACCCTGTGGA CTTTTGGCAT AAGGTGTCTG ACGTGCCGGA GATGTTCTGG
ACCGAGCATT TTGAGATCAA GTTACTCACT AAGGCCGATA AGGTTGCCGA GCTATTGCCT
AGCGATATCA CCAACTGGGC TTATTTAGGT GAGCATTTAG ATGTGGCTGA GGTACTGGGT
TTCACCAGTC GCAATCCTGA CTCTGTGATG AGCTATCTGC ATTTCCACCG TACCACTAAA
ACCGAATATG AACTCGAATG TATGCGCCGT GCGAACCAGA TCGCGGTGCA AGGGCATTTA
GCGGCTAAAA ATGCCTTCTA TAACGGTGCC AGCGAGTTTG AAATTCAGCA GCAGTATTTA
TCTGCCGTGG GACAGGGCGA GAACGAAGTG CCCTACGGTA ATATCATTGC CTTAAACCAA
AATGCGGCGA TTTTGCATTA CACCGCGCTC GAGCATCAAA ATCCGGCGCG GCGCTTATCT
TTTTTAATCG ATGCGGGCGC CAGTTATTTT GGCTACGCGT CGGATATCAC GCGTACCTAT
GCGTTTGAGA AGAATCGTTT CGATGAATTG ATCACTGCCA TGAACAAGGC GCAGCTCGAA
CTGATCGATA TGATGCGCCC AGGTGTGCGT TATCCCGATT TACACTTGGC TACCCATGGC
AAAGTCGCGC AAATGCTGTT GGATTTTGAG TTAGCCACGG GCGATGCCCA AGGCTTAGTC
GACCAAGGCA TAACCAGCGC CTTCTTCCCC CACGGACTCG GCCATATGTT AGGCTTGCAA
GTACATGATG TGGGAGGTTT TGCCTTCGAT GAGCGTGGTA CCCATATTCC GGCCCCTGAG
GCCCATCCGT TCCTGCGTTG CACCCGCATT TTAGCGCCAA ACCAAGTGCT AACGATGGAG
CCAGGATTAT ACATTATCGA TACTTTACTC AATGAGCTAA AACAAGATAG CCGTGACCAG
CAGATCAATT GGCGCACCGT TGATGAGTTG CGACCTTTCG GTGGTATCCG TATCGAGGAC
AATGTGATTG TGCATCAGGA TCGAAACGAA AATATGACCC GCGAGCTGGG CTTAGCGTGA
 
Protein sequence
MDHLAHHYHA HIAELNRRVA EIVSREALSG LVIHSGQPHR MFLDDINYPF KANPHFKAWL 
PVLDNPNCWL VVNGRDKPQL IFYHPVDFWH KVSDVPEMFW TEHFEIKLLT KADKVAELLP
SDITNWAYLG EHLDVAEVLG FTSRNPDSVM SYLHFHRTTK TEYELECMRR ANQIAVQGHL
AAKNAFYNGA SEFEIQQQYL SAVGQGENEV PYGNIIALNQ NAAILHYTAL EHQNPARRLS
FLIDAGASYF GYASDITRTY AFEKNRFDEL ITAMNKAQLE LIDMMRPGVR YPDLHLATHG
KVAQMLLDFE LATGDAQGLV DQGITSAFFP HGLGHMLGLQ VHDVGGFAFD ERGTHIPAPE
AHPFLRCTRI LAPNQVLTME PGLYIIDTLL NELKQDSRDQ QINWRTVDEL RPFGGIRIED
NVIVHQDRNE NMTRELGLA