Gene Shewmr4_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3420 
Symbol 
ID4253986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4088261 
End bp4089628 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content51% 
IMG OID638120058 
Productmajor facilitator transporter 
Protein accessionYP_735543 
Protein GI113971750 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000601324 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAACA ACGGACTTTC AGGAACCGAG AAAAAAGTCG CGTTTTCTTT AGCCAGCGTA 
TTTGGTTTAC GTATGATGGG CTTGTTTATG ATCATGCCCG TCTTTGCACT CTATGGTCAG
CATTTAGAAG GTTTTTCTCC CCTTTGGGTG GGGATTGCCA TTGGTGCCTA TGGTTTGACT
CAGGCCGTTT TGCAGATCCC TATGGGGATT TTATCAGACA AGTATGGTCG TAAACCTGTC
ATTCTCGCTG GCCTAGTATT GTTCGCCATC GGCAGCTTAA TTGCCGCCAA TGCCGATACG
ATTTACGGGG TGGTATTTGG CCGTGCCGTG CAAGGTATGG GGGCGATTGC CGCCGCCGTG
TTGGCCTTAG CGGCGGACTT AACCCGAGAT GAGCAGCGCA CTAAGGTGAT GGCCATTATC
GGCATGTGTA TCGGCGGCTC CTTTGCCCTG TCGTTACTCG TGGGCCCAAT TGTGGCGCAG
CATTTAGGCT TATCGGGTTT ATTCCTGCTA ACCGCTGGCC TTGCGGTACT TGGCATGTTG
ATTGTGCAGT TATTAGTGCC GAATCCGATT TCCCATGCAC CTAAGGGCGA TACCTTAGCC
GCGCCCGCCA AGCTCAAGCG TATGTTGACC GATCCGCAGC TGTTTAGGCT CGATGCGGGT
ATTTTTATTC TGCACTTAGT GTTAACCGCG GTGTTTGTCG CCTTGCCACT CGATTTAGTC
GATGCGGGTC TGGTGAAAGA AAAACATTGG ATGCTGTATT TCCCCGCGTT TGTGGGCGCA
TTCTTCTTGA TGGTGCCGTT AATCATCATT GGGGTGAAGC GTAAGAATAC TAAGGCGATG
TTCCAAATTG CTTTAGTGAT CATGATGTTT GCCCTTGCGG CCATGGCGAT GTTTTCGAAC
AACCTCTGGG TATTGAGCGT GGCAGTGCTG CTGTTTTTTA CCGGCTTTAA CTACCTTGAG
GCGTCGCTGC CGAGTTTGAT TGCGAAATTC TGCCCCGTTG GTGAAAAAGG CTCGGCCATG
GGGGTCTATT CAACTAGCCA ATTCTTAGGC GCATTCTGTG GAGGTATGCT CGGCGGCGGT
GCGTTCCAAT TAGTCGGCGC CGTGGGCGTG TTTATCGTCG CGCTGGTATT GATGGGCGTT
TGGTTATTAT TGACCCTAGG GATGAAAAAT CCTGTGCTGC TTAAGAGTTA CACCTTAGAG
GCTGCCGTAA AAGATAAGGC CCAAGCGCGG GATATGGCGT CGCAGCTGTC ACAATTGATG
GGTGTGGTCG AAGCGATTGT GGTGCTAGAT GAGAAAGTTG CTTATCTCAA AGTTGACGAG
CATTTCGATT TAAGAGAAGC CCGAGCTGTG TTAGGCTCTG CTCAATAA
 
Protein sequence
MGNNGLSGTE KKVAFSLASV FGLRMMGLFM IMPVFALYGQ HLEGFSPLWV GIAIGAYGLT 
QAVLQIPMGI LSDKYGRKPV ILAGLVLFAI GSLIAANADT IYGVVFGRAV QGMGAIAAAV
LALAADLTRD EQRTKVMAII GMCIGGSFAL SLLVGPIVAQ HLGLSGLFLL TAGLAVLGML
IVQLLVPNPI SHAPKGDTLA APAKLKRMLT DPQLFRLDAG IFILHLVLTA VFVALPLDLV
DAGLVKEKHW MLYFPAFVGA FFLMVPLIII GVKRKNTKAM FQIALVIMMF ALAAMAMFSN
NLWVLSVAVL LFFTGFNYLE ASLPSLIAKF CPVGEKGSAM GVYSTSQFLG AFCGGMLGGG
AFQLVGAVGV FIVALVLMGV WLLLTLGMKN PVLLKSYTLE AAVKDKAQAR DMASQLSQLM
GVVEAIVVLD EKVAYLKVDE HFDLREARAV LGSAQ