Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3970 |
Symbol | |
ID | 5319068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 420931 |
End bp | 422538 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640775779 |
Product | protein of unknown function DUF894 DitE |
Protein accession | YP_001312712 |
Protein GI | 150376116 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCGG CGGGTCTCGA GGGCACGGGC ACATTCGCTC CGTTGCGGCA GCAGGTCTTT GCCGTCCTTT GGGTGGCAAC GATCATAGGC AATACGGGCA GCTTCATCCG CGACGTCGCC AGTTCCTGGC TCGTGACCGA ACTTTCGGCG ACGCCCGCCG CGGTTTCGCT CGTACAGGCC GCGGCGACGC TTCCCGTTTT CCTGCTCGCC ATCCCGGCCG GAGTGCTTTC GGATATCCTC GACCGGCGAA AGTTCCTGAT CGCCGTTCAA TTGCTTCTCG CTTCCGCCAG TCTTTGCCTT CTCACGCTCT CCGCCATGGG GCTTCAGTCC GTCAGCTCGC TCGTTGCGCT TACTTTCATT GGGGGCATCG GAGCCGCTCT CGTCGCACCA ACATGGCAGG CGATCGTGCC TGAACTCGTG CAGAGGCAGG ACGTCAAAGG AGCGGTGGCG CTCAACTCGC TGGGCATCAA TATTTCGCGC GCGATCGGCC CGGCGCTCGG CGGTCTGCTG CTGGCCTGGT TCGGCGCTGC GGTCACCTAC GGGGTGGATG TCATCACTTA TGTCTTCGTC GTCGCTGCAC TGCTCTGGTG GCGGCGCGGG ACGCCTGAGG ACGATGCGCT TGCCGAGCGC TTCTTCGGCG CGTTCAGGGC GGGTCTGCGC TATGCCCGGT CGAGCAGGGA ACTGCACGTC GTCCTCATCC GCGCGGCCGT ATTCTTTGCT TTTTCGAGCG CCATCTGGGC ATTGCTGCCG CTCGTCGCGC GCCAGCTTCT CGGTGGAGAC GCCAGCTTCT ATGGCATGCT GCTTGGCGCC GTCGGCGCAG GCGCGGTCTG CGGCGCTCTG GTCATGCCGC GCCTGCGCGC CAGGCTGGAT GCCGATGGCC TGCTGCTCGC AGCCGCAATC GTCACCGCCG CTGTCATGGG CGTACTTTCG CTCGCTCCGC CGCAATGGGC GGCCGTCGCG GCCTTGCTGG CCTGCGGCGC TGCCTGGATC ACGGCGCTGA CGACGTTGAA CAGCACAGCG CAGTCCATCC TCCCCAATTG GGTGCGGGGA CGCTCGCTTG CCGTCTATCT GACCGTCTTC AACGGTGCGA TGACCGGGGG AAGCCTTGCC TGGGGCGCGA TCGCCGAAGC AGTTGGCGTC CCGTCGGCCC TCGGCATCGC CGCGATCGGC CTTCTCTGTG TCGGGCTCGG CTTTCATAGG GTAAAGCTCC CCAGAGGCGA GGCCGAGCTT GTGCCCTCCA ATCACTGGCC GGAACCGCTG ACGGCTCAGC CGGTTGAGAA CGACCGCGGC CCCGTGCTCA TCCTGATCGA ATACATCGTC GACAGGAAGG ACCGGCCCGC CTTTCTGAAA GCGCTTGCAA GCCTCTCGCA CGAGCGCCGC CGCGATGGGG CCTACGGCTG GGGCGTGACC GAGGATGCCG CCGATCCGAG CCGCGTCGTC GAATGGTTCA CGGTGGAGTC CTGGGCCGAA CACATGCGGC AGCACAGGCG CGTGTCGAAG GCGGACGCCG ATGTTCAGCA GGAAGTCCGC GCCTATCATC AGGGGTTGGA GCCTCCCGCC GTACAGCATC TTTTGGCAAT CAACCGCCCG CATATCAAGG GCAAGTGA
|
Protein sequence | MKAAGLEGTG TFAPLRQQVF AVLWVATIIG NTGSFIRDVA SSWLVTELSA TPAAVSLVQA AATLPVFLLA IPAGVLSDIL DRRKFLIAVQ LLLASASLCL LTLSAMGLQS VSSLVALTFI GGIGAALVAP TWQAIVPELV QRQDVKGAVA LNSLGINISR AIGPALGGLL LAWFGAAVTY GVDVITYVFV VAALLWWRRG TPEDDALAER FFGAFRAGLR YARSSRELHV VLIRAAVFFA FSSAIWALLP LVARQLLGGD ASFYGMLLGA VGAGAVCGAL VMPRLRARLD ADGLLLAAAI VTAAVMGVLS LAPPQWAAVA ALLACGAAWI TALTTLNSTA QSILPNWVRG RSLAVYLTVF NGAMTGGSLA WGAIAEAVGV PSALGIAAIG LLCVGLGFHR VKLPRGEAEL VPSNHWPEPL TAQPVENDRG PVLILIEYIV DRKDRPAFLK ALASLSHERR RDGAYGWGVT EDAADPSRVV EWFTVESWAE HMRQHRRVSK ADADVQQEVR AYHQGLEPPA VQHLLAINRP HIKGK
|
| |