Gene Shewmr4_1975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1975 
Symbol 
ID4252548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2348667 
End bp2349938 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content46% 
IMG OID638118587 
Productinner membrane transport protein YdhC 
Protein accessionYP_734105 
Protein GI113970312 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00432346 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.238152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGT CTAGTAATAT CTTTGTAAAT ATGAAGTTTT TTATATTCTT ATTCTATTTG 
GCATTATTAA GCATGTTAGG CTTTATTGCC ACTGACATGT ATTTGCCTGC TTTCAAAGCA
ATTGAAAGTT CGTTCAATTC TTCACCGTCT CAAGTAGCAA TGTCGCTCAC CTGTTTTTTG
GCTGGTTTAG CCTTAGGGCA ACTGATTTAT GGCCCCTTGG TCAGTAAACT CGGCAAACGT
TATGCTCTTA TCCTCGGCCT TGGCATTTTT GCGCTCGCCA GTGTGGCCAT CGCCAATAGC
GACTCGATAC TGATGTTAAA CATCGCTCGC TTCTTCCAAG CCGTTGGCGC CTGTAGTGCA
GGGGTCATCT GGCAAGCGAT TGTGGTCGAG CAATATGATG CCGAAAAAGC GCAGGGGATT
TTCAGTAACA TTATGCCGTT AGTGGCATTA TCACCCGCAT TAGCCCCCAT CCTTGGCGCT
TATATTCTGA ACGATTTTGG ATGGCGTGCA ATCTTTATCT CATTGTGTGT GATTGCCTTT
TTATTGGTGT TGATGACCTT ATACTTCGTG CCGAGCCATG CAGAGCATCA GGATGCTAAG
CCAAGCGCGG TTTCCTACGG CAAGATTTTG AAAAATACCC GTTACCTTGG CAATGTGGTG
ATTTTTGGTG CCTGTTCGGG TGCGTTTTTC GCATATCTTA CTGTATGGCC GATTGTGATG
GAGCAACACG GCTATCAGGC AACAGAGATT GGGCTGAGCT TTATTCCGCA AACCATCATG
TTTATTGTGG GCGGATACGC AAGCAAGTTA TTGATAAAAC GCATTGGTGC CGACCGTACA
CTCAACGTAT TGCTGTCCAT TTTTGGACTC TGCGTTATCT CGATTGTGTT TTTCACCTTA
TTAATGAAGG CGGAAACCAT TTTCCCACTG CTGATTTCCT TCTCGATACT CGCAGCGGCG
AACGGGGCGG TTTATCCCAT TGTGGTGAAC AGTGCTTTGC AGCAATTCAC TCAAAATGCG
GCTAAGGCGG CAGGATTACA GAACTTTTTG CAAATCACCA TCGCCTTTGG CGCCTCAAGT
TTAGTCGCAC TCTGGGCAAG TTCAGGAGAA GTCGCCATAG GTTGGGGCAT TCTGAGCTGT
TCATTAGTGG TGATCTTGGG TTACCTGTTA AAAACCGAAC AAACTTGGGC TGATTTTGCT
AAACACTTTA CTGCGCCAGA TCCTGCTCGT CTTGGGATCA ATGCAGATAC GAAGCAAAAT
CAAGCAGATT GA
 
Protein sequence
MKTSSNIFVN MKFFIFLFYL ALLSMLGFIA TDMYLPAFKA IESSFNSSPS QVAMSLTCFL 
AGLALGQLIY GPLVSKLGKR YALILGLGIF ALASVAIANS DSILMLNIAR FFQAVGACSA
GVIWQAIVVE QYDAEKAQGI FSNIMPLVAL SPALAPILGA YILNDFGWRA IFISLCVIAF
LLVLMTLYFV PSHAEHQDAK PSAVSYGKIL KNTRYLGNVV IFGACSGAFF AYLTVWPIVM
EQHGYQATEI GLSFIPQTIM FIVGGYASKL LIKRIGADRT LNVLLSIFGL CVISIVFFTL
LMKAETIFPL LISFSILAAA NGAVYPIVVN SALQQFTQNA AKAAGLQNFL QITIAFGASS
LVALWASSGE VAIGWGILSC SLVVILGYLL KTEQTWADFA KHFTAPDPAR LGINADTKQN
QAD