Gene Shewmr4_3012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3012 
Symbol 
ID4253583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3597813 
End bp3598820 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content47% 
IMG OID638119654 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_735140 
Protein GI113971347 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4150] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.615219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0264263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGTAA AATTGACTAA GGCCCTACTG GGAACCCTAT TGCTGGGAAC CACCCTCAAT 
GTGGCAGCCG CCGATCAGAC ACTATTAAAT TCCTCGTACG ATATCGCGCG GGAATTATTC
AATGCCTATA ACCCTGTTTT TGCTAAACAT TGGCAAGAAA AAACTGGCAA GACAGTTGAA
ATCAAGCAAT CCCATGCGGG TTCTTCCGCC CAGGCTCGCT CGATTCTTCA GGGCTTACCC
GCCGATGTCG TGACCTTTAA CCAAGTCACC GATGTGCAAA TTCTCCATGA CCGCGGCAAG
TTGATCCCAG AAAACTGGCA ACAACTGCTG CCGAATGCCA GCTCACCTTA CTACTCCACC
ATCGCGTTTT TGGTGCGTAA GGGCAATCCA AAGCAAATCA GCGACTGGAA TGATTTAGCC
AAAGATGACG TTAAGTTGGT GTTCCCGAAC CCAAAAACCT CGGGTAATGC GCGTTACACT
TACTTAGCAG CCTTAGGTTA TGCCCAGAAA AACTATGGTA AAGATAATCA AGCCTCATTA
GATGAGTTTT TGAAGAAGTT CCTCGGTAAC GTTGCCGTAT TTGATACGGG CGGCCGTGGT
GCAACCACGT CGTTTGTTGA ACGTGGCATC GGCGATGTGC TGATCACCTT CGAATCTGAA
GTTAACAATA TTCGTCAGCA ATATGGTGCC GATGATTACC AAGTCGTCGT GCCAAAAACC
TCGATTCTGG CAGAATTCCC CGTTGCCGTA GTTGAGAAAA ACGCTAAGCG TAACGGCACG
CAAGAACTCG CAACCGAGTA TTTAAACTAC CTTTACAGCG AAGAAGCACA ACGTTTGTTA
GCCGGATTTA ACTACCGCGT TCACAACGAG AAAGTCGTTG CCGAATTTAC CAAGCAATTC
CCAACGGTTG AATTAATGAC CGTTGAGCAA ATCATAGGTA ACTGGGACAA CGCGATGAAG
ACCCAATTTG CCAATGGCGC CAAACTGGAT CAGTTATTAA AACGCTGA
 
Protein sequence
MPVKLTKALL GTLLLGTTLN VAAADQTLLN SSYDIARELF NAYNPVFAKH WQEKTGKTVE 
IKQSHAGSSA QARSILQGLP ADVVTFNQVT DVQILHDRGK LIPENWQQLL PNASSPYYST
IAFLVRKGNP KQISDWNDLA KDDVKLVFPN PKTSGNARYT YLAALGYAQK NYGKDNQASL
DEFLKKFLGN VAVFDTGGRG ATTSFVERGI GDVLITFESE VNNIRQQYGA DDYQVVVPKT
SILAEFPVAV VEKNAKRNGT QELATEYLNY LYSEEAQRLL AGFNYRVHNE KVVAEFTKQF
PTVELMTVEQ IIGNWDNAMK TQFANGAKLD QLLKR