Gene Shewmr4_1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1212 
Symbol 
ID4251251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1413003 
End bp1414496 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content48% 
IMG OID638117797 
Productextracellular solute-binding protein 
Protein accessionYP_733349 
Protein GI113969556 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACTTA TTAGCATGTT AACACTTGTA TCCGAGGGGT TATTCTTGTT AAAACCTTTG 
TCACTCTTAC TTGTCGGACT GGGTTTCACG CTAGCAGCAC CTGCCTTAGC GGAATTTTCC
TTGGCACAAA TTCCCCCCCT CACGAACAAA ACGCCCATCA CCTTAGTCGC CGAAAAAGGC
TTTTGGACTG ACTACCTAAC GCAGGAAATG TTACCCAAGT TCACCGAAAA AACCGGCGTT
AAAGTCAATG TGGTCAGTAC CGAGCTGGAA GGCATGTTCG AACTACAGAC CCATGCATTG
CTGCAAGGCG AAGGAAAATA TGATCTGCTA ACCATGGAAG CCGGCTGGGC AAAAGAGTGG
GCCGCCAATG GCTATACCGT CCCCCTATTA GAACTCGCAA AACTCTACGA TCCCGACGGT
GAAAAAGCGA TGGAAAGCTA TCTCGAACCA TATTACCCAT CCTTGCTGAA TATTTTGTCC
TATCACGGTG AGTTACACGC TATTCCCTAC AATAACTATG TGATGGGGAG TCATTACCGC
GCGGATTTAT TTGAGAATAA AACCGAACAA GCTCAGTTTC AGCAACGCTT TGGTTACCCA
CTCAAACCCG CGACTAATTT CGATGAACTA CAGGACCAAG CGGTATTTTT TACCCGAAAA
GCAGGCGAAC TGCTTGCGGG CAAGCCCTTA AGCCACGACT TTTATGGCTT GGCATTAATG
TCTGGCAACA AGCCCCATAT TAACGATGAA TTTTCGAGCA TCATTTGGAG CTTAGGGGGC
GCTTGGATGT GGCCCGTCTA TAATCCGCAA AAGCAGATCA CCCACTTCGA AGTGCCCGCA
GTCAACCAAG AGGCGATAAA GGCAAGCGCT ATCTATCGAA AACTAATGCC TTACGCCATT
CCGGCGGATG ATAAATTTGC CTTCAATGAA GCGGCCAATG CACTGGCAAG CGGACAAGTG
GCGATTTGGC CCTTTGCCTA CAACAATCTC TGGTCGGTCT CCTTTAAAGT CGAGGCCAAT
GTGCCAGGCG CTCGGCTTGG CGTGGCTCAA GTGCCTGGCG GCATGCCCTA TAACGGTGCC
TACGCTTTTG CCGTGAGCTA CGACAGTAAA AATCCGCAAG CCGCCTATTG GTTACTCAAA
TACATGGGCA CCTATGAAGC CCAATATGCC TACGCCCTCG GCGGCGGCAA TCCCTGCAGA
ATGGATGTGG TCACAGCGCC AGAATTTAAA CAAGCCTCTA AGAGATCCAT TGCTGGCGCC
TTTGAGGCGA GCCAAATTGC TAACTTATTT TGGTCTACTA AAGTATTGGA AGTAGGACAC
TTTACCACTA CCGCTATGGG GCAGATTTAT CCCGAATTAA GTCATGCTTG TTATGCAGCA
AGCCGACACG AAGCCAACAG TACCGATATT TTTATTGAGT TAAGCAGCAA AATTAAACAC
CTGCAGAATA CCTATGGGGA AGTGCCTGCA ATTGATAAGA AAAATAACCA ATGA
 
Protein sequence
MLLISMLTLV SEGLFLLKPL SLLLVGLGFT LAAPALAEFS LAQIPPLTNK TPITLVAEKG 
FWTDYLTQEM LPKFTEKTGV KVNVVSTELE GMFELQTHAL LQGEGKYDLL TMEAGWAKEW
AANGYTVPLL ELAKLYDPDG EKAMESYLEP YYPSLLNILS YHGELHAIPY NNYVMGSHYR
ADLFENKTEQ AQFQQRFGYP LKPATNFDEL QDQAVFFTRK AGELLAGKPL SHDFYGLALM
SGNKPHINDE FSSIIWSLGG AWMWPVYNPQ KQITHFEVPA VNQEAIKASA IYRKLMPYAI
PADDKFAFNE AANALASGQV AIWPFAYNNL WSVSFKVEAN VPGARLGVAQ VPGGMPYNGA
YAFAVSYDSK NPQAAYWLLK YMGTYEAQYA YALGGGNPCR MDVVTAPEFK QASKRSIAGA
FEASQIANLF WSTKVLEVGH FTTTAMGQIY PELSHACYAA SRHEANSTDI FIELSSKIKH
LQNTYGEVPA IDKKNNQ