Gene Ssed_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_1940 
Symbol 
ID5610333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp2339033 
End bp2340037 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content40% 
IMG OID640932826 
Productsugar-binding protein, putative 
Protein accessionYP_001473679 
Protein GI157375079 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGATTAT CGATAATACT TTTAGTATTT ACCATGTTGC CACTATTTTC AGCCAGTGGG 
AAAGATCTTA AGTTCGCAGT TGTTCCTAAG TACCACAGTG TTTTTTTTGA ACAGAGCAAA
CATGGTTGTA AGGATGCAGC CACTCAAATA AAAGGCGTCG AGTGTATATA TCGGGGCCCT
GAAAAAGCGA GTGTTAGAGT ACAGGATCAG ATTATTTCAC AACTGATCGA TGAGGGGGTT
GATGGCATCG CTGTAGCCGT TACACAGTCT AAATTCCTCG CAGAAAATAG TATTCAAAAA
GCACGAAATG CTGGAATACC TATTGTCACT TATGACTCTG ATTTTGACCT TCAAACCTTG
GAAAAGTATA AAAAGATACG CTCAACTTAT ATAGGGACAG ATAATTTTCA GTTTGGTAGA
GCTTTAGGGG AACAACTAAA AAAACAGCGC CCCAATGGAG GAACATTAAT TATTCAAACT
GGACGCCCAG ACTCTCCAAA TTTGAATCTT AGAATTATGG GGATCCGTTC TGCTCTGTCT
GGCAAACAAT ATAATACTCC TCCCGGGAAA ATGCTCCTAA ATGATAGTGG CTGGACTGAA
GTAAGAGAGC CTTTTATTAA TTTTGATCAG CTTTCAAGGG CGGTAAAGCA GATGGAGTCA
GTGGTACAGG GAAGGCGATT AAAAGCGGAC TCCTTTATTG CCGTTGGTGG TTGGCCTCAA
AATGATGAAG CCCTTTATCG AAAAATGATC GCCCCTTTTA AAGAGAAGCT TGAGCGTAAA
GAGGTGATAG TTGTTATCTC TGATGCATCA GATCAGCAGT TAATCATGTT ACGAGACCAG
CTTGCTCATG CCAATGTTGG CCAAAACCCT TATGAGATGG GAAGGCAAGC CATTTTAACC
CTGCATAATA TTGTAAAAAA TCTAGATTAC GATGAGTTTA TTCATACCCC TATTAATTTG
TGTACCCGGG AAAACTACAC TAGCTGCACC CAACACAATT TATAA
 
Protein sequence
MRLSIILLVF TMLPLFSASG KDLKFAVVPK YHSVFFEQSK HGCKDAATQI KGVECIYRGP 
EKASVRVQDQ IISQLIDEGV DGIAVAVTQS KFLAENSIQK ARNAGIPIVT YDSDFDLQTL
EKYKKIRSTY IGTDNFQFGR ALGEQLKKQR PNGGTLIIQT GRPDSPNLNL RIMGIRSALS
GKQYNTPPGK MLLNDSGWTE VREPFINFDQ LSRAVKQMES VVQGRRLKAD SFIAVGGWPQ
NDEALYRKMI APFKEKLERK EVIVVISDAS DQQLIMLRDQ LAHANVGQNP YEMGRQAILT
LHNIVKNLDY DEFIHTPINL CTRENYTSCT QHNL