Gene Sama_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2044 
Symbol 
ID4604294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2480486 
End bp2481460 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content54% 
IMG OID639781421 
Producthypothetical protein 
Protein accessionYP_927919 
Protein GI119775179 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.47923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGA CTTTGCTCGG CGCCGCTACG GCGTTATGGG GTATGAAAGA GCTTGAAACC 
TTTGCCAACA GGCCTCTGAT GCTTGAGCAG GCGAGGGAGC TGGAACTCAA TCGGGGAACC
AATGCCCGCG CATTGGGCAA AGAGTTGGTT GAGCAGGGAC TCCTTGAAGG CTCATGGCAT
TACGATTGGT ATCTGCGATT AAATCCCGCC ATGGCGGGCA TTCGTCAGGG GTTGTACGAA
ATCACCCCAG GCGACACGGT CAAGTCACTG CTTGAAAAGC TCATCAGCGG TAAAGTGAAG
GACTTCGCAA TCACTTTGGT TGAAGGCCAA ACGCTGCGGG AATGGCAGGC TAAGCTGGAA
ACTGCCCCCA GGCTGAATTG GGACGCGGAT GTTTTCCATA AGGTTCTCAA GGCGAACGGT
GATGATTCCG GATTGCCGGA AGGGAAATTT TTTCCCGATA CCTACAGCTA TCCTGCCAAC
CAGGATGTGG AAACGCTGCT TACCCAGAGC TACCTGAAGA TGCAGCAAGA GCTGGCAGCG
GCCTGGCAGG TCAGAGCTCC CGATTTACCT CTGGCGAGCG CCTATGAGCT GCTTATCCTG
GCGTCCATTA TAGAGAAAGA GACCGGCAAG GCCGAGGAGC GCCCCTTGAT TGCTGCGGTG
TTTATCAACC GGCTGCGAAA GGGAATGCGA CTGCAGACAG ACCCCACGGT GATTTATGGC
ATGGGAACGC GCTTTAACGG CAATATCACC CGTAAAGATC TGCGTGAGGA TACGCCCTTC
AACACCTATC GCATTCAAGG ACTGCCACCT ACGCCTATTG CGGCACCCGG ACGTGAAGCC
TTGATGGCGG CGGCACAACC GGCACAATCA GATTATCTTT ACTTTGTGTC CAGAAACGAT
GGCAGCCACG TATTTTCCCG CACACTCGCT GAACACAATC GCGCAGTAAA CCAATTTCAG
AGAAAACAAA AATGA
 
Protein sequence
MSLTLLGAAT ALWGMKELET FANRPLMLEQ ARELELNRGT NARALGKELV EQGLLEGSWH 
YDWYLRLNPA MAGIRQGLYE ITPGDTVKSL LEKLISGKVK DFAITLVEGQ TLREWQAKLE
TAPRLNWDAD VFHKVLKANG DDSGLPEGKF FPDTYSYPAN QDVETLLTQS YLKMQQELAA
AWQVRAPDLP LASAYELLIL ASIIEKETGK AEERPLIAAV FINRLRKGMR LQTDPTVIYG
MGTRFNGNIT RKDLREDTPF NTYRIQGLPP TPIAAPGREA LMAAAQPAQS DYLYFVSRND
GSHVFSRTLA EHNRAVNQFQ RKQK