Gene Shew_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_0020 
Symbol 
ID4920935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp23495 
End bp24817 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content56% 
IMG OID640161532 
Productproline dipeptidase 
Protein accessionYP_001092152 
Protein GI127510955 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00930691 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACT TAGCCACCCT CTATCCCGCC CACATTATTG AACTGAACCG GCGCGTCGCC 
GAGATCACCG CAAGAGAGCA GTTGGCCGGC TTAGTGATTC ATTCGGGTCA GCCTCATCGT
CAGTTTCTCG ATGATCTGGA TTACCCCTTT AAAGTTAATC CGCACTTCAA GGCCTGGCTT
CCGGTTATCG ATAATCCCCA CTGCTGGCTG ATCGTCAACG GTCGGGATAA GCCGCAACTC
ATCTTCTATC GTCCGGTGGA TTTTTGGCAT AAGGTCGCCG ATCTACCAGA GGATTTTTGG
ACGACCGAGA TAGAGATTAA AGTGCTGACC AAGGCCGACA AGGTGGCAGA TCTGCTGCCG
GGCAAGTTGC AGGAGTGGGC CTATATTGGC GAGCATCTGG ATGTCGCCGA TGTGCTGGGC
TTCGGAAGCC GTAACCCCGA GGCGGTGATG AGCTACCTGC ATTATCACAG AGCCAGCAAG
ACGGCCTATG AGTTGGCCTG CATGCGCCGG GCGAGTGAGA TCGGTGTGCG TGGTCATGTG
GCGGCCAAGA GTGCCTTCTA TGCGGGCGCG AGCGAGTTTG AGATCCAGCA AGCCTACCTG
GCTGCGACTG ATATGGGCGA GAACGATGTG CCCTACGGCA ACATTATCGC ACTGAATCAA
AACGCCGCGA TTCTGCACTA CACGGCGCTG GAGCATGTGT CGCCCAAGCA GCGACTCTCC
TTCCTTATCG ATGCCGGTGG TAGCTTCCAT GGCTATGCCT CGGACATCAC CCGTACCTAT
GCCTTCGAGA AGAACCTGTT CGGCGACCTG ATCGCCGCCA TGGACAAGTT ACAGCTGGCC
ATCATCGAGA TGATGCGTCC GGGCGTGAAG TATGTAGATC TGCATCTGGC GACACACCAG
AAGCTGGCAC AGCTGCTGCT GGACTTCAAG TTAGTGCAAG GCGATCCCCA AGGACTGATA
GAGCAGGGGA TCACCAGCGC CTTCTTCCCC CATGGTCTGG GGCATATGTT GGGCCTACAG
GTACATGATA TGGGCGGCTT CCTCCACGAC GAGCGCGGCA CCCACATTGC ACCGCCGGAG
GCGCATCCCT TCCTGCGCTG TACCCGCACC CTGGCCGCTA ACCAGGTGCT GACCATAGAG
CCAGGGCTTT ACATCATCGA CAGCCTGCTT AACGAGTTGA AACAGGATGG TCGCGCCGAT
TGGATTAACT GGCAGATGGT GGATCAGGTG CGCCCCTTCG GTGGCATTCG TATCGAAGAC
AATGTGATCG TCCATAGCGA TCATAACGAA AATATGACTC GCGATCTGGG TCTGCACGGT
TAA
 
Protein sequence
MENLATLYPA HIIELNRRVA EITAREQLAG LVIHSGQPHR QFLDDLDYPF KVNPHFKAWL 
PVIDNPHCWL IVNGRDKPQL IFYRPVDFWH KVADLPEDFW TTEIEIKVLT KADKVADLLP
GKLQEWAYIG EHLDVADVLG FGSRNPEAVM SYLHYHRASK TAYELACMRR ASEIGVRGHV
AAKSAFYAGA SEFEIQQAYL AATDMGENDV PYGNIIALNQ NAAILHYTAL EHVSPKQRLS
FLIDAGGSFH GYASDITRTY AFEKNLFGDL IAAMDKLQLA IIEMMRPGVK YVDLHLATHQ
KLAQLLLDFK LVQGDPQGLI EQGITSAFFP HGLGHMLGLQ VHDMGGFLHD ERGTHIAPPE
AHPFLRCTRT LAANQVLTIE PGLYIIDSLL NELKQDGRAD WINWQMVDQV RPFGGIRIED
NVIVHSDHNE NMTRDLGLHG