Gene Smed_3437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3437 
Symbol 
ID5324323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3641836 
End bp3643491 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content64% 
IMG OID640792387 
Productprotein of unknown function DUF894 DitE 
Protein accessionYP_001329090 
Protein GI150398623 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.545505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA ACCGAAAGAG ATCAGAGGTG ACGAACCGCA CATCGCCGTT GGCTCCCTTC 
AGGCACGACA TCTTCCGCAC GATCTGGATC GCGAGCCTTG CTTCCAATTT CGGCGGGCTG
ATCCAGGCCG TGGGTGCGGC CTGGCTCATG ACGTCCATTT CGCAATCGGT GAACATGGTG
GCGCTGGTGC AGGCCTCGAC CTCGCTGCCG ATCATGCTCT TCTCACTCGT TTCCGGCGCG
CTTGCCGACA ATTTCGACCG GCGGCGGATC ATGCTCGTCG CTCAGAGCTT CATGCTCGCC
GTTTCGGGGC TGCTTACCGT CTGCGCCTAT TACGGTATCG TTACGCCGTG GCTTCTGCTC
ATCTTCACCT TTCTTCTCGG CTGCGGCACG GCTTTGAACA ATCCATCCTG GCAGGCTTCG
GTCGGCGACA TGGTTCCGCG TGACGACCTG CCGGCAGCGG TGGCGCTGAA CAGCATGGGA
TTCAATCTGA CCCGCAGCGT CGGCCCGGCG ATCGGCGGCG CGATCGTCGC AGCCGCGGGT
GCCGCGGCCG CCTTCGCCGC CAACACACTC AGCTATTTCG CGATCCTGTT CGCGCTTGCC
CGATGGAAAC CGGTAACCCC GGAAAACCGG CTGCCGCGCG AGACGCTCGG GCGCGCCGTT
TCCGCGGGCC TGCGCTACGT GGCGATGTCG CCCAATATCG GCAAGGTGCT CGTGCGCGGC
TTCGCCTTCG GCCTTTCGGC GAGCGCCATT CTCGCCCTGC TGCCGCTGGT GGCGCGCGAC
CTCGTCGGCG GCGGGCCGCT CACTTACGGC GTCATGCTCG GCGCCTTCGG CCTCGGCGCG
ATCGGCGGCG CGCTTTTGAG CGCAAGGCTG AGGGAATTCC TCACGAGCGA GGCGATCGTG
CGTTATGCCT TTGCCGGCTT CGCCTTCAGC GCATTGGTCA CAGCCATTAG TTCGGAAGCC
TGGCTGACCT GTCTCGTGCT CGCTGTTTCC GGCGCGTGCT GGGTGCTGGC GCTTTCGCTC
TTCAACACCA CGGTGCAGCT TTCGACACCG CGCTGGGTCG TCGGCCGGGC GCTTTCGCTC
TATCAGACGA TGACCTTCGG CGGGATCGCC GGCGGCAGTT GGCTGTGGGG TGTCACCGCC
GAACAATACG GCGCAGCCAA CGCGCTCATC GGCTCCTGTC TTCTGATGCT CGTGGGGGCG
GCGATCGGAC TGCGCTTCGC CCTGCCGGAG TTCAAGTCGC TCAACCTCGA CCCGCTCAAC
CGCTTCAACG AACCGCTGCT CGAACTCGAC CTGAAGCCGC GCAGCGGGCC GATCGTCGTC
ATGATCGATT ACGATATCGC CGATAACGAC ATACCCGAAT TCCTGAAGAC CATGGCCGAG
CGGCGGCGCA TCCGCATCCG TGACGGGGCC GGGCATTGGG CGCTCATGCG CGACCTCGAA
AACCCGACGA CCTGGACCGA GACTTATCAC GTGCCGACCT GGGTCGAATA TGTCCGCCAC
AATCAGCGCC GTACCCAGGC CGACGCCGCC ATTGGGGACA AGCTGACCGC ACTCCATCGG
GGACCGAACC CGCCGCGGGT GCACCGCATG ATCGAGCGGC AGACGATCGT TCCCGACCAT
TACGAGCGCT ACAAGCGATC CGTCGAGATG CACTGA
 
Protein sequence
MKINRKRSEV TNRTSPLAPF RHDIFRTIWI ASLASNFGGL IQAVGAAWLM TSISQSVNMV 
ALVQASTSLP IMLFSLVSGA LADNFDRRRI MLVAQSFMLA VSGLLTVCAY YGIVTPWLLL
IFTFLLGCGT ALNNPSWQAS VGDMVPRDDL PAAVALNSMG FNLTRSVGPA IGGAIVAAAG
AAAAFAANTL SYFAILFALA RWKPVTPENR LPRETLGRAV SAGLRYVAMS PNIGKVLVRG
FAFGLSASAI LALLPLVARD LVGGGPLTYG VMLGAFGLGA IGGALLSARL REFLTSEAIV
RYAFAGFAFS ALVTAISSEA WLTCLVLAVS GACWVLALSL FNTTVQLSTP RWVVGRALSL
YQTMTFGGIA GGSWLWGVTA EQYGAANALI GSCLLMLVGA AIGLRFALPE FKSLNLDPLN
RFNEPLLELD LKPRSGPIVV MIDYDIADND IPEFLKTMAE RRRIRIRDGA GHWALMRDLE
NPTTWTETYH VPTWVEYVRH NQRRTQADAA IGDKLTALHR GPNPPRVHRM IERQTIVPDH
YERYKRSVEM H