Gene Sama_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0033 
Symbol 
ID4602290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp36808 
End bp38127 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content53% 
IMG OID639779342 
Productproline dipeptidase 
Protein accessionYP_925915 
Protein GI119773175 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.12189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0842725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATT TGGCCCCGCT GTACGTCAAT CACATCAATG AACAAAACCG CCGGGTGGCC 
GATGTGTTGG CCCGGGAACA ACTTGAAGGC CTGGCCATTC ATTCAGGCCA ATATCACCGT
CAGTTTTTGG ATGACATCAA CTATCCCTTT AAGGCAAATC CGCATTTCAA GGCCTGGCTG
CCGGTGCTGG ATATTCCCAA CTGCTGGATA CTAACCAATG GGCGTGATAA GCCTGTGTTG
GTGTTCTATC GCCCTGTGGA TTTCTGGCAT AAGGTCAGCG ATGTGCCCGA GAGTTTCTGG
ACCGAGCATT TTGAAATTAA GCTGCTGACT AAGGCAGAGA AGGTGGCGGA TTTGCTGCCA
AAGAACCTCG ATACCTGGGC TTATATTGGT GAACATCTGG ATGTGGCCGA CGTGCTGGGC
TTCAAGAATC GCAATCCCGA TGGAGTGATG AACTACTTCC ATTGGCACAG AAGCTTCAAG
ACAGACTATG AGCTTGCCTG TATGCGTGAG GCAAACCGAG TCGCGGTGGC AGGCCATAAT
GCTGCGAGAG AAGCCTTCTA CAAGGGCGCC AGTGAGTTTG AAATACAGCA GCAATATTTG
TCGGCCATAG GTCAGGGCGA GAATGACGTG CCTTACGGCA ACATCATTGC TCTGAACCAA
AATGCGGCCA TCCTGCACTA CACCGCGCTG GAGCACGCGG CGCCTGCCAA CCGGCATTCG
TTTTTAATCG ATGCAGGCGC TTCTTTTAAT GGCTACGCCG CAGACATAAC CCGCACCTAT
GCATTTGAAA AAAATGTGTT TGATGAGCTT ATCAAGGCAA TGGACAAGAT GCAGCGGGAG
CTGGTGGACA TGATGCGTCC CGGTGTGCGC TTTACCGATC TGCATCTGGC TACACATCAC
AAATTGGCTC AGCTGCTGCT GGAGTTTGGT ATCGCCAGGG GCGAGGCCAG CGACTTGGTG
GAGCAGGGGG TGACCAGTGT GTTCTTCCCT CATGGGCTTG GGCATATGTT GGGCTTACAG
GTGCACGATG TGGCTGGGTT TGCCCATGAT GAGCGTGGCA CCCATTTGGC TGCGCCAGAG
CGTCATCCCT TCCTGCGCTG TACCCGAGTT CTTGCGCCAC GCCATGTGCT GACCATAGAG
CCTGGTTTTT ACATCATCGA CAGTCTGTTG ACCGAACTGA AGGCCGATGG CCGGGCTGAA
GCAGTGAACT GGGACATGGT AAATACACTG CGTCCCTTTG GCGGCATACG TATTGAGGAT
AACGTGATTG TGCATCAGGA GCGCAACGAG AATATGACCC GGGATCTGGG GCTTAACTGA
 
Protein sequence
MENLAPLYVN HINEQNRRVA DVLAREQLEG LAIHSGQYHR QFLDDINYPF KANPHFKAWL 
PVLDIPNCWI LTNGRDKPVL VFYRPVDFWH KVSDVPESFW TEHFEIKLLT KAEKVADLLP
KNLDTWAYIG EHLDVADVLG FKNRNPDGVM NYFHWHRSFK TDYELACMRE ANRVAVAGHN
AAREAFYKGA SEFEIQQQYL SAIGQGENDV PYGNIIALNQ NAAILHYTAL EHAAPANRHS
FLIDAGASFN GYAADITRTY AFEKNVFDEL IKAMDKMQRE LVDMMRPGVR FTDLHLATHH
KLAQLLLEFG IARGEASDLV EQGVTSVFFP HGLGHMLGLQ VHDVAGFAHD ERGTHLAAPE
RHPFLRCTRV LAPRHVLTIE PGFYIIDSLL TELKADGRAE AVNWDMVNTL RPFGGIRIED
NVIVHQERNE NMTRDLGLN