Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1622 |
Symbol | |
ID | 4252198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 1918076 |
End bp | 1919386 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 638118234 |
Product | major facilitator transporter |
Protein accession | YP_733755 |
Protein GI | 113969962 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.227848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTATGCC CAGTGCTAAT CTTTGTCTAT CCTGAGAGTG ATGCGTTATC GAGGCCAAAA GTGCAAGCCG TGTTAGACAC TATTATCGAC AGTAAACAAC GTGATACCCG ACTGATGTGG GCCCTGTGTG TGGCTTCCGT GGTGGTGTAT ATCAATCTGT ATTTGATGCA GGGCATGTTA CCGCTGATCG CCGAGCATTT TGCGGTATCG GGCTCTAAGG CAACCCTTAT CCTTTCGGTT ACCAGCTTTT CGCTGGCGTT TTCGCTGTTA ATTTATGCGG TTGTGTCCGA CAGAATTGGC CGCCACACGC CGATTGTCGT GAGTCTCTGG CTACTGGCGC TGTCGAATCT GCTGTTGATT TGGGCTGGGG ATTTTAATGC TCTTGTCTAC GTACGCTTTT TACAGGGCGT GCTGTTAGCG GCGGTGCCCG CCATTGCAAT GGCCTATTTT AAGGAGCAAC TCTCGCCAAG CACTATGCTC AAAGCCGCGG GTATTTATAT CATGGCCAAC AGTATCGGCG GGATTGTCGG TCGGTTACTG GGCGGGGTGA TGTCGCAGTT TTTATCTTGG CAAGAGTCCA TGTGGCTGCT GTTTTTAGTC ACGCTTGCGG GCGTTGCCTT AACCAGTTAT TTATTGCCTT CTGGCGCCGA TGCACAGGCG GTATCGGGCG GACAAACCAC CTCGCCAACA CTGTCAAAAC GGGCACGTTT ATTACAGGAT ATTTATGGCT TTAGCCATCA CCTAACCGAT CCGCAGATGC GTTTAGCCTA TGCCATCGGT GGGATCACTT TTATGATGAT GGTGAATCAA TTTAGCTTTA TTCAGCTGCA TTTGATGGCC GCACCCTACG AGTGGAGCCG TTTCCAAGCG ACGTTGATCT TCCTGTGTTA TTCCAGTGGT ACCGTGGCTT CTTATTTTAC TGCCAAATGG CTGGCCAAAT TTGGTCAGCA CAAGTTATAC CAATGGTCTT GGTGCTTGAT GTTACTGGGC AGTTTATTGA CCCTGTTCGA TACTCCAGTC ACGATTTGTC TGGGCTTTTT GATGACGGCC TGTGGCTTTT TCCTAACCCA CAGCTGCTGC AATTCTTTTG TGGCGATGCG CGCGAGTCGC GACCGCGCTA AAGCCACCTC ACTGTATCTG TGTTGCTATT ACTTAGGCGC CGCGCTGGGC GGGCCTTACT TGATGCTGTT TTGGCATAAA GCCGAGTGGC AGGGGGTAGT GATGGGATCA TTAACACTCC TTGCCTTGAT AGCCTTGTCG ATTGCGCGTT TGCGTTATCA CCAGACCCAA ATGAACCGCG TCGAGGTATA G
|
Protein sequence | MLCPVLIFVY PESDALSRPK VQAVLDTIID SKQRDTRLMW ALCVASVVVY INLYLMQGML PLIAEHFAVS GSKATLILSV TSFSLAFSLL IYAVVSDRIG RHTPIVVSLW LLALSNLLLI WAGDFNALVY VRFLQGVLLA AVPAIAMAYF KEQLSPSTML KAAGIYIMAN SIGGIVGRLL GGVMSQFLSW QESMWLLFLV TLAGVALTSY LLPSGADAQA VSGGQTTSPT LSKRARLLQD IYGFSHHLTD PQMRLAYAIG GITFMMMVNQ FSFIQLHLMA APYEWSRFQA TLIFLCYSSG TVASYFTAKW LAKFGQHKLY QWSWCLMLLG SLLTLFDTPV TICLGFLMTA CGFFLTHSCC NSFVAMRASR DRAKATSLYL CCYYLGAALG GPYLMLFWHK AEWQGVVMGS LTLLALIALS IARLRYHQTQ MNRVEV
|
| |