Gene Smed_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3970 
Symbol 
ID5319068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp420931 
End bp422538 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content65% 
IMG OID640775779 
Productprotein of unknown function DUF894 DitE 
Protein accessionYP_001312712 
Protein GI150376116 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGG CGGGTCTCGA GGGCACGGGC ACATTCGCTC CGTTGCGGCA GCAGGTCTTT 
GCCGTCCTTT GGGTGGCAAC GATCATAGGC AATACGGGCA GCTTCATCCG CGACGTCGCC
AGTTCCTGGC TCGTGACCGA ACTTTCGGCG ACGCCCGCCG CGGTTTCGCT CGTACAGGCC
GCGGCGACGC TTCCCGTTTT CCTGCTCGCC ATCCCGGCCG GAGTGCTTTC GGATATCCTC
GACCGGCGAA AGTTCCTGAT CGCCGTTCAA TTGCTTCTCG CTTCCGCCAG TCTTTGCCTT
CTCACGCTCT CCGCCATGGG GCTTCAGTCC GTCAGCTCGC TCGTTGCGCT TACTTTCATT
GGGGGCATCG GAGCCGCTCT CGTCGCACCA ACATGGCAGG CGATCGTGCC TGAACTCGTG
CAGAGGCAGG ACGTCAAAGG AGCGGTGGCG CTCAACTCGC TGGGCATCAA TATTTCGCGC
GCGATCGGCC CGGCGCTCGG CGGTCTGCTG CTGGCCTGGT TCGGCGCTGC GGTCACCTAC
GGGGTGGATG TCATCACTTA TGTCTTCGTC GTCGCTGCAC TGCTCTGGTG GCGGCGCGGG
ACGCCTGAGG ACGATGCGCT TGCCGAGCGC TTCTTCGGCG CGTTCAGGGC GGGTCTGCGC
TATGCCCGGT CGAGCAGGGA ACTGCACGTC GTCCTCATCC GCGCGGCCGT ATTCTTTGCT
TTTTCGAGCG CCATCTGGGC ATTGCTGCCG CTCGTCGCGC GCCAGCTTCT CGGTGGAGAC
GCCAGCTTCT ATGGCATGCT GCTTGGCGCC GTCGGCGCAG GCGCGGTCTG CGGCGCTCTG
GTCATGCCGC GCCTGCGCGC CAGGCTGGAT GCCGATGGCC TGCTGCTCGC AGCCGCAATC
GTCACCGCCG CTGTCATGGG CGTACTTTCG CTCGCTCCGC CGCAATGGGC GGCCGTCGCG
GCCTTGCTGG CCTGCGGCGC TGCCTGGATC ACGGCGCTGA CGACGTTGAA CAGCACAGCG
CAGTCCATCC TCCCCAATTG GGTGCGGGGA CGCTCGCTTG CCGTCTATCT GACCGTCTTC
AACGGTGCGA TGACCGGGGG AAGCCTTGCC TGGGGCGCGA TCGCCGAAGC AGTTGGCGTC
CCGTCGGCCC TCGGCATCGC CGCGATCGGC CTTCTCTGTG TCGGGCTCGG CTTTCATAGG
GTAAAGCTCC CCAGAGGCGA GGCCGAGCTT GTGCCCTCCA ATCACTGGCC GGAACCGCTG
ACGGCTCAGC CGGTTGAGAA CGACCGCGGC CCCGTGCTCA TCCTGATCGA ATACATCGTC
GACAGGAAGG ACCGGCCCGC CTTTCTGAAA GCGCTTGCAA GCCTCTCGCA CGAGCGCCGC
CGCGATGGGG CCTACGGCTG GGGCGTGACC GAGGATGCCG CCGATCCGAG CCGCGTCGTC
GAATGGTTCA CGGTGGAGTC CTGGGCCGAA CACATGCGGC AGCACAGGCG CGTGTCGAAG
GCGGACGCCG ATGTTCAGCA GGAAGTCCGC GCCTATCATC AGGGGTTGGA GCCTCCCGCC
GTACAGCATC TTTTGGCAAT CAACCGCCCG CATATCAAGG GCAAGTGA
 
Protein sequence
MKAAGLEGTG TFAPLRQQVF AVLWVATIIG NTGSFIRDVA SSWLVTELSA TPAAVSLVQA 
AATLPVFLLA IPAGVLSDIL DRRKFLIAVQ LLLASASLCL LTLSAMGLQS VSSLVALTFI
GGIGAALVAP TWQAIVPELV QRQDVKGAVA LNSLGINISR AIGPALGGLL LAWFGAAVTY
GVDVITYVFV VAALLWWRRG TPEDDALAER FFGAFRAGLR YARSSRELHV VLIRAAVFFA
FSSAIWALLP LVARQLLGGD ASFYGMLLGA VGAGAVCGAL VMPRLRARLD ADGLLLAAAI
VTAAVMGVLS LAPPQWAAVA ALLACGAAWI TALTTLNSTA QSILPNWVRG RSLAVYLTVF
NGAMTGGSLA WGAIAEAVGV PSALGIAAIG LLCVGLGFHR VKLPRGEAEL VPSNHWPEPL
TAQPVENDRG PVLILIEYIV DRKDRPAFLK ALASLSHERR RDGAYGWGVT EDAADPSRVV
EWFTVESWAE HMRQHRRVSK ADADVQQEVR AYHQGLEPPA VQHLLAINRP HIKGK