Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3437 |
Symbol | |
ID | 5324323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3641836 |
End bp | 3643491 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640792387 |
Product | protein of unknown function DUF894 DitE |
Protein accession | YP_001329090 |
Protein GI | 150398623 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.545505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA ACCGAAAGAG ATCAGAGGTG ACGAACCGCA CATCGCCGTT GGCTCCCTTC AGGCACGACA TCTTCCGCAC GATCTGGATC GCGAGCCTTG CTTCCAATTT CGGCGGGCTG ATCCAGGCCG TGGGTGCGGC CTGGCTCATG ACGTCCATTT CGCAATCGGT GAACATGGTG GCGCTGGTGC AGGCCTCGAC CTCGCTGCCG ATCATGCTCT TCTCACTCGT TTCCGGCGCG CTTGCCGACA ATTTCGACCG GCGGCGGATC ATGCTCGTCG CTCAGAGCTT CATGCTCGCC GTTTCGGGGC TGCTTACCGT CTGCGCCTAT TACGGTATCG TTACGCCGTG GCTTCTGCTC ATCTTCACCT TTCTTCTCGG CTGCGGCACG GCTTTGAACA ATCCATCCTG GCAGGCTTCG GTCGGCGACA TGGTTCCGCG TGACGACCTG CCGGCAGCGG TGGCGCTGAA CAGCATGGGA TTCAATCTGA CCCGCAGCGT CGGCCCGGCG ATCGGCGGCG CGATCGTCGC AGCCGCGGGT GCCGCGGCCG CCTTCGCCGC CAACACACTC AGCTATTTCG CGATCCTGTT CGCGCTTGCC CGATGGAAAC CGGTAACCCC GGAAAACCGG CTGCCGCGCG AGACGCTCGG GCGCGCCGTT TCCGCGGGCC TGCGCTACGT GGCGATGTCG CCCAATATCG GCAAGGTGCT CGTGCGCGGC TTCGCCTTCG GCCTTTCGGC GAGCGCCATT CTCGCCCTGC TGCCGCTGGT GGCGCGCGAC CTCGTCGGCG GCGGGCCGCT CACTTACGGC GTCATGCTCG GCGCCTTCGG CCTCGGCGCG ATCGGCGGCG CGCTTTTGAG CGCAAGGCTG AGGGAATTCC TCACGAGCGA GGCGATCGTG CGTTATGCCT TTGCCGGCTT CGCCTTCAGC GCATTGGTCA CAGCCATTAG TTCGGAAGCC TGGCTGACCT GTCTCGTGCT CGCTGTTTCC GGCGCGTGCT GGGTGCTGGC GCTTTCGCTC TTCAACACCA CGGTGCAGCT TTCGACACCG CGCTGGGTCG TCGGCCGGGC GCTTTCGCTC TATCAGACGA TGACCTTCGG CGGGATCGCC GGCGGCAGTT GGCTGTGGGG TGTCACCGCC GAACAATACG GCGCAGCCAA CGCGCTCATC GGCTCCTGTC TTCTGATGCT CGTGGGGGCG GCGATCGGAC TGCGCTTCGC CCTGCCGGAG TTCAAGTCGC TCAACCTCGA CCCGCTCAAC CGCTTCAACG AACCGCTGCT CGAACTCGAC CTGAAGCCGC GCAGCGGGCC GATCGTCGTC ATGATCGATT ACGATATCGC CGATAACGAC ATACCCGAAT TCCTGAAGAC CATGGCCGAG CGGCGGCGCA TCCGCATCCG TGACGGGGCC GGGCATTGGG CGCTCATGCG CGACCTCGAA AACCCGACGA CCTGGACCGA GACTTATCAC GTGCCGACCT GGGTCGAATA TGTCCGCCAC AATCAGCGCC GTACCCAGGC CGACGCCGCC ATTGGGGACA AGCTGACCGC ACTCCATCGG GGACCGAACC CGCCGCGGGT GCACCGCATG ATCGAGCGGC AGACGATCGT TCCCGACCAT TACGAGCGCT ACAAGCGATC CGTCGAGATG CACTGA
|
Protein sequence | MKINRKRSEV TNRTSPLAPF RHDIFRTIWI ASLASNFGGL IQAVGAAWLM TSISQSVNMV ALVQASTSLP IMLFSLVSGA LADNFDRRRI MLVAQSFMLA VSGLLTVCAY YGIVTPWLLL IFTFLLGCGT ALNNPSWQAS VGDMVPRDDL PAAVALNSMG FNLTRSVGPA IGGAIVAAAG AAAAFAANTL SYFAILFALA RWKPVTPENR LPRETLGRAV SAGLRYVAMS PNIGKVLVRG FAFGLSASAI LALLPLVARD LVGGGPLTYG VMLGAFGLGA IGGALLSARL REFLTSEAIV RYAFAGFAFS ALVTAISSEA WLTCLVLAVS GACWVLALSL FNTTVQLSTP RWVVGRALSL YQTMTFGGIA GGSWLWGVTA EQYGAANALI GSCLLMLVGA AIGLRFALPE FKSLNLDPLN RFNEPLLELD LKPRSGPIVV MIDYDIADND IPEFLKTMAE RRRIRIRDGA GHWALMRDLE NPTTWTETYH VPTWVEYVRH NQRRTQADAA IGDKLTALHR GPNPPRVHRM IERQTIVPDH YERYKRSVEM H
|
| |