Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3012 |
Symbol | |
ID | 4253583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 3597813 |
End bp | 3598820 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 638119654 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_735140 |
Protein GI | 113971347 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.615219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0264263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGTAA AATTGACTAA GGCCCTACTG GGAACCCTAT TGCTGGGAAC CACCCTCAAT GTGGCAGCCG CCGATCAGAC ACTATTAAAT TCCTCGTACG ATATCGCGCG GGAATTATTC AATGCCTATA ACCCTGTTTT TGCTAAACAT TGGCAAGAAA AAACTGGCAA GACAGTTGAA ATCAAGCAAT CCCATGCGGG TTCTTCCGCC CAGGCTCGCT CGATTCTTCA GGGCTTACCC GCCGATGTCG TGACCTTTAA CCAAGTCACC GATGTGCAAA TTCTCCATGA CCGCGGCAAG TTGATCCCAG AAAACTGGCA ACAACTGCTG CCGAATGCCA GCTCACCTTA CTACTCCACC ATCGCGTTTT TGGTGCGTAA GGGCAATCCA AAGCAAATCA GCGACTGGAA TGATTTAGCC AAAGATGACG TTAAGTTGGT GTTCCCGAAC CCAAAAACCT CGGGTAATGC GCGTTACACT TACTTAGCAG CCTTAGGTTA TGCCCAGAAA AACTATGGTA AAGATAATCA AGCCTCATTA GATGAGTTTT TGAAGAAGTT CCTCGGTAAC GTTGCCGTAT TTGATACGGG CGGCCGTGGT GCAACCACGT CGTTTGTTGA ACGTGGCATC GGCGATGTGC TGATCACCTT CGAATCTGAA GTTAACAATA TTCGTCAGCA ATATGGTGCC GATGATTACC AAGTCGTCGT GCCAAAAACC TCGATTCTGG CAGAATTCCC CGTTGCCGTA GTTGAGAAAA ACGCTAAGCG TAACGGCACG CAAGAACTCG CAACCGAGTA TTTAAACTAC CTTTACAGCG AAGAAGCACA ACGTTTGTTA GCCGGATTTA ACTACCGCGT TCACAACGAG AAAGTCGTTG CCGAATTTAC CAAGCAATTC CCAACGGTTG AATTAATGAC CGTTGAGCAA ATCATAGGTA ACTGGGACAA CGCGATGAAG ACCCAATTTG CCAATGGCGC CAAACTGGAT CAGTTATTAA AACGCTGA
|
Protein sequence | MPVKLTKALL GTLLLGTTLN VAAADQTLLN SSYDIARELF NAYNPVFAKH WQEKTGKTVE IKQSHAGSSA QARSILQGLP ADVVTFNQVT DVQILHDRGK LIPENWQQLL PNASSPYYST IAFLVRKGNP KQISDWNDLA KDDVKLVFPN PKTSGNARYT YLAALGYAQK NYGKDNQASL DEFLKKFLGN VAVFDTGGRG ATTSFVERGI GDVLITFESE VNNIRQQYGA DDYQVVVPKT SILAEFPVAV VEKNAKRNGT QELATEYLNY LYSEEAQRLL AGFNYRVHNE KVVAEFTKQF PTVELMTVEQ IIGNWDNAMK TQFANGAKLD QLLKR
|
| |