Gene Shewmr4_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1622 
Symbol 
ID4252198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1918076 
End bp1919386 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content50% 
IMG OID638118234 
Productmajor facilitator transporter 
Protein accessionYP_733755 
Protein GI113969962 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.227848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATGCC CAGTGCTAAT CTTTGTCTAT CCTGAGAGTG ATGCGTTATC GAGGCCAAAA 
GTGCAAGCCG TGTTAGACAC TATTATCGAC AGTAAACAAC GTGATACCCG ACTGATGTGG
GCCCTGTGTG TGGCTTCCGT GGTGGTGTAT ATCAATCTGT ATTTGATGCA GGGCATGTTA
CCGCTGATCG CCGAGCATTT TGCGGTATCG GGCTCTAAGG CAACCCTTAT CCTTTCGGTT
ACCAGCTTTT CGCTGGCGTT TTCGCTGTTA ATTTATGCGG TTGTGTCCGA CAGAATTGGC
CGCCACACGC CGATTGTCGT GAGTCTCTGG CTACTGGCGC TGTCGAATCT GCTGTTGATT
TGGGCTGGGG ATTTTAATGC TCTTGTCTAC GTACGCTTTT TACAGGGCGT GCTGTTAGCG
GCGGTGCCCG CCATTGCAAT GGCCTATTTT AAGGAGCAAC TCTCGCCAAG CACTATGCTC
AAAGCCGCGG GTATTTATAT CATGGCCAAC AGTATCGGCG GGATTGTCGG TCGGTTACTG
GGCGGGGTGA TGTCGCAGTT TTTATCTTGG CAAGAGTCCA TGTGGCTGCT GTTTTTAGTC
ACGCTTGCGG GCGTTGCCTT AACCAGTTAT TTATTGCCTT CTGGCGCCGA TGCACAGGCG
GTATCGGGCG GACAAACCAC CTCGCCAACA CTGTCAAAAC GGGCACGTTT ATTACAGGAT
ATTTATGGCT TTAGCCATCA CCTAACCGAT CCGCAGATGC GTTTAGCCTA TGCCATCGGT
GGGATCACTT TTATGATGAT GGTGAATCAA TTTAGCTTTA TTCAGCTGCA TTTGATGGCC
GCACCCTACG AGTGGAGCCG TTTCCAAGCG ACGTTGATCT TCCTGTGTTA TTCCAGTGGT
ACCGTGGCTT CTTATTTTAC TGCCAAATGG CTGGCCAAAT TTGGTCAGCA CAAGTTATAC
CAATGGTCTT GGTGCTTGAT GTTACTGGGC AGTTTATTGA CCCTGTTCGA TACTCCAGTC
ACGATTTGTC TGGGCTTTTT GATGACGGCC TGTGGCTTTT TCCTAACCCA CAGCTGCTGC
AATTCTTTTG TGGCGATGCG CGCGAGTCGC GACCGCGCTA AAGCCACCTC ACTGTATCTG
TGTTGCTATT ACTTAGGCGC CGCGCTGGGC GGGCCTTACT TGATGCTGTT TTGGCATAAA
GCCGAGTGGC AGGGGGTAGT GATGGGATCA TTAACACTCC TTGCCTTGAT AGCCTTGTCG
ATTGCGCGTT TGCGTTATCA CCAGACCCAA ATGAACCGCG TCGAGGTATA G
 
Protein sequence
MLCPVLIFVY PESDALSRPK VQAVLDTIID SKQRDTRLMW ALCVASVVVY INLYLMQGML 
PLIAEHFAVS GSKATLILSV TSFSLAFSLL IYAVVSDRIG RHTPIVVSLW LLALSNLLLI
WAGDFNALVY VRFLQGVLLA AVPAIAMAYF KEQLSPSTML KAAGIYIMAN SIGGIVGRLL
GGVMSQFLSW QESMWLLFLV TLAGVALTSY LLPSGADAQA VSGGQTTSPT LSKRARLLQD
IYGFSHHLTD PQMRLAYAIG GITFMMMVNQ FSFIQLHLMA APYEWSRFQA TLIFLCYSSG
TVASYFTAKW LAKFGQHKLY QWSWCLMLLG SLLTLFDTPV TICLGFLMTA CGFFLTHSCC
NSFVAMRASR DRAKATSLYL CCYYLGAALG GPYLMLFWHK AEWQGVVMGS LTLLALIALS
IARLRYHQTQ MNRVEV