Gene Smed_3180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3180 
Symbol 
ID5324059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3345216 
End bp3347105 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content62% 
IMG OID640792128 
Productmajor facilitator transporter 
Protein accessionYP_001328839 
Protein GI150398372 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAACG TCGTGACAGC GGGCGGCACA AAAGCCGGCC CGATGACCGG CGAGGAGAAG 
AAGGTCATCT TCGCCTCGTC GCTCGGTACC GTCTTCGAAT GGTACGATTT CTATCTTTAC
GGCTCACTTG CCGTTTATAT CGGTGCAACC TTCTTCAGCC AGTATCCGGA GACAACGCGC
AACATCTTCG CGCTGCTCGC CTTCGCCGCC GGCTTCCTGG TCCGGCCGTT CGGCGCACTG
GTTTTCGGCC GCCTGGGCGA TCTCGTCGGG CGCAAATACA CGTTCCTCGT CACGATCCTG
ATCATGGGTC TGTCGACATT CCTCGTCGGC GTTCTGCCCG GTGCGGCCTC GATCGGCATC
GCTGCGCCGA TCATCCTGAT CGCGCTCCGC CTCCTGCAGG GCCTCGCGCT TGGCGGTGAA
TACGGCGGTG CTGCAACCTA TGTTGCAGAG CATGCGCCGC ATGGCCGGCG CGGTTATTTC
ACGTCGTGGA TTCAGACGAC GGCGACGCTC GGTCTCTTCC TCTCGCTGGT CGTTATTCTT
CTGGTCCAAT ATATGCTTGG CAAGGAAGCC TTCGCCGAGT GGGGCTGGCG CATACCGTTC
CTCCTATCCT TCGTGCTTCT TGGCGTTTCC GTCTGGATCC GCCTGAAGAT GAACGAATCG
CCTGCCTTCA AGAAGATGAA GGAGGAAGGC AAGGGTTCGA AGGCCCCGCT GACGGAAGCA
TTCGGCCATT GGCGCAACGC CAAGATCGCC CTTCTGGCTC TCTTCGGCGC TGTCGTCGGC
CAGGCCGTGG TCTGGTATTC CGGCCAATTC TACGCGCTGT TCTTCCTGCA GAGCATACTA
AAGGTCGACG GCCAGTCGGC AAACCTGATG GTCGCCGCCT CGCTCCTGCT CGGCACGGGC
TTTTTCGTCT TCTTCGGCTG GCTGTCGGAC AAGATCGGCC GTAAGCCGAT CATCATGGCG
GGCCTTCTGC TTGCCATGCT GACCTACTTC CCGCTGTTCA AAGCGCTGAC CTGGGCCGGC
AATCCCGCAC TTGCGGAAGC GCAGCAGAGC GTCCGCGCAA CCGTTACCGC GGCTCCGGGC
GACTGCAAGT TCCAGTTCAA CCCGACCGGT ACGGCGAAGT TCACCACGTC GTGCGATATA
GCGACCGCGT TCCTGACCAG GAACTCCGTT CCCTACGACG TCGTGACCTC GGCGGCTGCA
GGGACGCCGG CGACGGTGAA GATCGGCGAA ACGACGATCA CCAGCTATGA CGCGATCGCG
GCCGGCGACA AGGCAAAGGC CGAGGAAGCG GCCTTTGGCA AACAGATCAA CATGGCACTG
CAGTCCTCGG GCTATCCGCT GGTGCGCGGA GCCGCCAAGG TGCCGGAATC GAAGCTCGAC
GCTTTCGTCG CGGCTAATCC GGAGCTTGCA CTCGACGCGG CTGCCGTACG CGCCGGCGAA
AAGACCATGG TCCCGGCCGA CAAGCTCGTT GCAGACAAGC TGCTCACGCA GGAAGAGGTC
GGCAGCGCCA CCGAAATGGC CGTCTATTCT ATAGACAAGG GCGGCGCCTT CACCATGGTG
GCCGACCCGG CTCGCGTCAA CTGGACGGTG ATCATCGCCG TGCTGACCGT GCTCGTCATC
TATGTGACCA TGGTCTACGG CCCAATCGCG GCACTGCTCG TCGAGCTCTT CCCGACACGC
ATCCGTTACA CGGGCATGTC GCTGCCTTAC CATATCGGCA ATGGCTGGTT CGGTGGGCTG
CTTCCGGCAA CGGCCTTCGC GATGAGCGCG GCCAAGGGCG ATATCTACTA CGGCCTGTGG
TATCCGATCG TATTTGCGGG CATCACGCTG GTCATCGGCC TTCTGTTCCT GCCCGAAACG
AAGGACCGGG ATATCCACAC GATGGAGTGA
 
Protein sequence
MANVVTAGGT KAGPMTGEEK KVIFASSLGT VFEWYDFYLY GSLAVYIGAT FFSQYPETTR 
NIFALLAFAA GFLVRPFGAL VFGRLGDLVG RKYTFLVTIL IMGLSTFLVG VLPGAASIGI
AAPIILIALR LLQGLALGGE YGGAATYVAE HAPHGRRGYF TSWIQTTATL GLFLSLVVIL
LVQYMLGKEA FAEWGWRIPF LLSFVLLGVS VWIRLKMNES PAFKKMKEEG KGSKAPLTEA
FGHWRNAKIA LLALFGAVVG QAVVWYSGQF YALFFLQSIL KVDGQSANLM VAASLLLGTG
FFVFFGWLSD KIGRKPIIMA GLLLAMLTYF PLFKALTWAG NPALAEAQQS VRATVTAAPG
DCKFQFNPTG TAKFTTSCDI ATAFLTRNSV PYDVVTSAAA GTPATVKIGE TTITSYDAIA
AGDKAKAEEA AFGKQINMAL QSSGYPLVRG AAKVPESKLD AFVAANPELA LDAAAVRAGE
KTMVPADKLV ADKLLTQEEV GSATEMAVYS IDKGGAFTMV ADPARVNWTV IIAVLTVLVI
YVTMVYGPIA ALLVELFPTR IRYTGMSLPY HIGNGWFGGL LPATAFAMSA AKGDIYYGLW
YPIVFAGITL VIGLLFLPET KDRDIHTME