Gene Sama_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0078 
Symbol 
ID4602335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp84659 
End bp86656 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content50% 
IMG OID639779390 
Productprolyl oligopeptidase family protein 
Protein accessionYP_925960 
Protein GI119773220 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGA TGATGACCAT AGTGCTATCG CTTATATGGA TGTGGGTGAG CCCCACCGCC 
CACGCCTATA CGCAATTAAC AAAAGACGAC TTTATCAGTG ACCCTCTCAT CTACGACGCC
GAATTTTCAC CGGATGGTCG TTATCTGGCG TTTATTCGCC AGGCAGGTAA GAGCCGCGAT
GTTGTCATCC GGGACTTTTC TCAGGAAGGG GCGCCTATCA CCGGGATATT GCAGGATGAA
TTTATTCGCG CCGACTCTAT TAGTTGGGCA AATAACACTC GGGTGATTGT GAATCTGATG
GTGCCCTACG AGCGTATCTC CAAATTAAAA AAGAAGGCGG AGAAAGACCC GGAGTTCGAC
CTCGATGAGT ACGATTACTT CAGGCGTTCC ATTTCCATGG ATGTGCACTG TCAGGATAGG
GTGGTGCTGC TCAACCATAA AAAATACAGC CGCAAAAACT TAAACCTGTC CAGAGTCAGT
AACCTGCTGG TGGATGACGA GCAGCATATT CTTATGCCCG CCTGGGGCCA TAAAGGCCTC
GAAATCATGA AAGTGAATGT GTACACGGGG AAGGGTGAAG TGGTGCTGGA GGGCGGAAGG
CGCACCTACA ACATTTTAAC CGACAAACAG GGCCAGCCTA CGTTCAGGCT GGACTATTAC
TATTACAGCC GCAGTGTGCA AGTGTACGAA TACACTCAGG AAGGCGAATG GGTTCCCATC
GACCGTATCT ATTTCGAGCA AAATGAAGAT GGCGAATTCG ACTTTGAAGG CTTGGTAGGT
ATTGGTAAGG AAGGCGAGCT GATATACCGC AAGCGCAATG AAACGAGCGG CTATTATGAA
ATCGTCAAAT ATAAAAAAGG CAGCAAAGAG AAGCAGGTTG TCGCCTCGCT GCCTGAAGAA
GATATTTACT CTCCCATGTT TGACGCCTTC ACCGGCGAAT ATCTGGGTTA CCAGGTGCAG
CGGGACCTCA TCCGTAATGT GTATTTGGAT AAGAGTTATC AGGCCCACTA CGACAAGGTG
GCTGAAGACA TAGGGCACAG CAACTTCTCT TTCTGGGCTT CGAGCACCAG TAAAAACCGT
GTGGTGGTAA AAAGCAGCGG TGCAGACCAT CTGGGCAAAT TCTATGTTTA CGACTACAAG
ACCCAGGCCC TGACATGGCT GGGGGATGTA CATAATCAAC TGGTGCCCGA GAATCTCGGG
TTGCCCGCCA AGGTGAATTA CAGTACCAGA GACGGGCAAA AGCTGCGTAT GTACCTGCTA
TTCCCACCCA ATTATGACGA CACCAAAGCC TACCCTATGG TGGTGTTACC CCACGGCGGC
CCACAATCCC GTGACAGTGC CAGCTTTGAT TTCTTCGCCC AGTTCATTGC TACCCGGGGT
TATATCGTCA TACAGCCAAA CTTCCGCGGT TCTACCGGTT ATGGACTGGA ATTTGAGAAA
GCTGGCTACA AACAGTGGGG ACAGCGGATG CAGGACGATG TGTCAGACGC CGTCACCTAC
ATGACCCAAA ATGGCTACGC GGATAAGTCC AGGGTGTGCA TTGTGGGGGC CTCCTATGGT
GGTTATGCCG CGCTGATGGG CGCCATTAAA ACCCCTGAGC TGTACCGTTG CAGCATCAGT
ATTAACGGGG TGACCCACCT TAAAGATCAA ATCGCGTTTG ACGTGGATTC CGCAGAAATA
AACGAAGACA GAATTGAAGA GATACTCTAC GAACGCATTG GTCATCCCAT CCGGGACGCC
AAAATGTTGG ATGACAATTC ACCGGCGTTA CTTGCATCAA AAGTGAGCTT ACCACTGCTG
ATTATCGCCG GAGACAGCGA TCAAATCGTG CCCTACACCC AGGCTGAAGT CATGGTTGAG
GCACTGGCGA AGTCGAAGAA AGACTTTAAG TTTGTCGAAC TGACAGACAC AGGCCATAAC
CCCTTTATCC TGAAAGACAG CGCCGCCAAG GTGTATCAGG AAGTTGAGCA GTTCCTGAAA
ACTCACCTTG GGGAATAG
 
Protein sequence
MKVMMTIVLS LIWMWVSPTA HAYTQLTKDD FISDPLIYDA EFSPDGRYLA FIRQAGKSRD 
VVIRDFSQEG APITGILQDE FIRADSISWA NNTRVIVNLM VPYERISKLK KKAEKDPEFD
LDEYDYFRRS ISMDVHCQDR VVLLNHKKYS RKNLNLSRVS NLLVDDEQHI LMPAWGHKGL
EIMKVNVYTG KGEVVLEGGR RTYNILTDKQ GQPTFRLDYY YYSRSVQVYE YTQEGEWVPI
DRIYFEQNED GEFDFEGLVG IGKEGELIYR KRNETSGYYE IVKYKKGSKE KQVVASLPEE
DIYSPMFDAF TGEYLGYQVQ RDLIRNVYLD KSYQAHYDKV AEDIGHSNFS FWASSTSKNR
VVVKSSGADH LGKFYVYDYK TQALTWLGDV HNQLVPENLG LPAKVNYSTR DGQKLRMYLL
FPPNYDDTKA YPMVVLPHGG PQSRDSASFD FFAQFIATRG YIVIQPNFRG STGYGLEFEK
AGYKQWGQRM QDDVSDAVTY MTQNGYADKS RVCIVGASYG GYAALMGAIK TPELYRCSIS
INGVTHLKDQ IAFDVDSAEI NEDRIEEILY ERIGHPIRDA KMLDDNSPAL LASKVSLPLL
IIAGDSDQIV PYTQAEVMVE ALAKSKKDFK FVELTDTGHN PFILKDSAAK VYQEVEQFLK
THLGE