Gene Spea_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpea_2039 
Symbol 
ID5662432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella pealeana ATCC 700345 
KingdomBacteria 
Replicon accessionNC_009901 
Strand
Start bp2467255 
End bp2469306 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content46% 
IMG OID641236634 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001501894 
Protein GI157961860 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.033765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CTGCAATCAT AATAGCGCTT GCATCTGCAG GGGCCGTATT TAGCGCTCAG 
GCCACGTCAA CATCACCTTT CACTGTGCAA GACCTCGTTA AAATGAATAA GCTTCATTCG
GCAGCATTGT CAAATGATGG CACAAAAATT GTTTATGCGG TGAAAAATAT TGATGCAGAA
GGTGAGGCGA GTACTGATCT TTACATTCAA GACCTGACTT CAAGCTCTTC AAAAGCAAAA
CAGATCACCA GTGCCGCTGG TACTGAGCAT AGCGTCGCTT TTGCTCCTAA TGACAAGAGT
ATCTATTTCC TTGCTGCGCG TAATGGCTCA AGCCAAGTGT ACGAGCTTGC ACTCGACGGC
GGTGAAGCGA TTCAGGTAAC CGATTTTCCG TTAAATGTAG AAGGCTATAA GCTATCGCAA
GATGGCAAGC AAGTCGTAGT CAACATGCGT GTCTTCCCAG ACTGTAAAGA TCTAGCTTGC
TCAAAAGATA AATTCACAGC TGAAGCCGAG CGTGATACTA CGGGTCGCGA ATATAAGCAG
CTAATGGTGC GTCACTGGGA CACTTGGAGT GACCACTCAA GAAGCCATCT GTTTGTTGCC
CAATTAAACG GCGAAAAAGT GACTTCAGCT ATTGATGTTA CTGCCGGATT AGATACCGAA
ACACCGCCAA AGCCATTCTC TGGCATGGAA GAAGTGACCT TCACGCCAGA TGGTAAGCAT
GTAGTTTATA CGGCTAAAGC ACCGGGTAAA GATCAGGCTT GGATCACTAA TTACGATTTA
TGGAAAGTGA GTGTGGCTGG TGGCGAGGCT GAAAACTTAA CCGAGTCGAA TAAGGCTTGG
GATGCACACC CAACGTATTC AGCCGATGGC CGTTATCTGG CCTATTTCGC CATGACAAAA
CCTGGCTTTG AAGCCGATCG CTACCGCATT ATTTTGCGTG ATACCGTAAC GGGCCAAGAG
AAAGAAGTGG CGCCGCTATG GGATCGCAGC CCAAGCTCTC TAGCTTTTGG TAAAGACAAC
AAAACATTAT ATGTGACTGC ACAAGATGTT GGTCAGGTAT CAATCTTTGA AGTGAATACC
CAGTTTGGTG ATGTTAAGAC CATCTATAAC GAAGGTAGCA ACAGCATCGT TGGTGTGAAT
AATGACAAGA TTATCTTTAG TCATAAATCG CTAGTTGAGC CAGGCGATCT TTATACCATC
AACCTAGACG GTCAAAACCT TAATCGTGTG ACTGAAGTTA ACAAAGACAA GTTAGCTAAA
GTTAACTTTG GTGAATACCA GCAATTTAGC TTTAAGGGCT GGAATAACGA AGAAGTATAT
GGCTATTGGA TTAAGCCTTC AAACTACAAG GCTGGTGAGA AGTACCCCAT TGCTTTCTTA
GTGCACGGTG GCCCACAAGG ATCATTTGGT AACTCATTTA GCAGCCGCTG GAACCCTCAG
TTATGGGCTG GAGCGGGTTA TGGTGTGGTG ATGATCGATT TCCATGGCTC AACTGGTTAT
GGCCAAGCGT TTACTGATTC AATCACTCAA GATTGGGGCG GCAAGCCACT TGAAGATCTG
CAAAAAGGTT TAGCGGCGGT GACTAAGCAG CAGAAGTGGT TAGACGGTGA TAATGCTTGT
GCACTTGGAG GTTCATACGG TGGATACATG ATGAACTGGA TCCAGGGTAA CTGGAATGAC
GGTTTCAAGT GCCTAGTGAA TCACGCTGGT CTATTTGATA TGCGCTCTAT GTACTATGTA
ACTGAAGAGC TTTGGTTCCC AGAGTTTGAG TTTGGCGGTA CCTACGAGAA GAACAAAGAG
CTATACGAGA AGTTTAACCC AGTGAATTAT GTAGAGAACT GGAAGACTCC TATGCTAGTT
ATCCATGGCG AGAAAGACTT CCGTGTTCCA TACGGTCAAG GCCTCGCGGC ATTCACCTAC
ATGCAGCGCA ATGGGATCCC ATCAGAGTTA CTCATCTATC CAGATGAAAA CCACTGGATC
TTAACCCCTG AAAACCTAGA GCAGTGGTAC GCCAACGTAC TAGGCTGGAT GGATCGCTGG
ACAGAGAAGT AA
 
Protein sequence
MKKTAIIIAL ASAGAVFSAQ ATSTSPFTVQ DLVKMNKLHS AALSNDGTKI VYAVKNIDAE 
GEASTDLYIQ DLTSSSSKAK QITSAAGTEH SVAFAPNDKS IYFLAARNGS SQVYELALDG
GEAIQVTDFP LNVEGYKLSQ DGKQVVVNMR VFPDCKDLAC SKDKFTAEAE RDTTGREYKQ
LMVRHWDTWS DHSRSHLFVA QLNGEKVTSA IDVTAGLDTE TPPKPFSGME EVTFTPDGKH
VVYTAKAPGK DQAWITNYDL WKVSVAGGEA ENLTESNKAW DAHPTYSADG RYLAYFAMTK
PGFEADRYRI ILRDTVTGQE KEVAPLWDRS PSSLAFGKDN KTLYVTAQDV GQVSIFEVNT
QFGDVKTIYN EGSNSIVGVN NDKIIFSHKS LVEPGDLYTI NLDGQNLNRV TEVNKDKLAK
VNFGEYQQFS FKGWNNEEVY GYWIKPSNYK AGEKYPIAFL VHGGPQGSFG NSFSSRWNPQ
LWAGAGYGVV MIDFHGSTGY GQAFTDSITQ DWGGKPLEDL QKGLAAVTKQ QKWLDGDNAC
ALGGSYGGYM MNWIQGNWND GFKCLVNHAG LFDMRSMYYV TEELWFPEFE FGGTYEKNKE
LYEKFNPVNY VENWKTPMLV IHGEKDFRVP YGQGLAAFTY MQRNGIPSEL LIYPDENHWI
LTPENLEQWY ANVLGWMDRW TEK