Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4373 |
Symbol | |
ID | 5317982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 874380 |
End bp | 875720 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776177 |
Product | general substrate transporter |
Protein accession | YP_001313110 |
Protein GI | 150376514 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGTG CAATTCAACA ATCCCTGGGA CCGTCGTCCT CCTCGATGGA GCGCGATGCG CGGCGCATTC ACGACGACAG GCCGGTATCT GCAGGCAACA TAGCCATTGG CGTCGTGATC GGCCGCGCAG CGGAGTTCTT CGATTTCTTC GTTTTTGGCA TCGCCTCCGT CGTCGTTTTT CCGCAGCTCT TCTTTCCCTT CGCGCCCGAC CGGCTGACCG CTACGCTCTA TTCCTTCGCC ATCTTTGCAC TCGCCTTCAT GGCAAGGCCG GTGGGTTCGC TGGTCTTCAT GGCCATCGAC CGGACCTATG GCCGAGGCGT GAAACTAACG ATTGCCCTTT TCCTCCTCGG CGGATCGACC GCCTCCATCG CCTTTCTGCC CGGCTACGCA ACGCTCGGCG GCTGGTCCAT CCTGCTGCTT GCCGTCTTCC GTCTCGGTCA GGGATTTGCG CTCGGCGGGG CCTGGGATGG GCTCGCCTCG CTTCTCGCCC TCAACGCGCC GCAGCATCAC CGCGGCTGGT ATGCGATGAT CCCACAGCTA GGGGCGCCGC TCGGCTTCAT GCTGGCGAGT GCGCTGTTTG CCTACTTCCT CGCTTCGCTC TCGCAAGCGG ACTTTCTTGC CTGGGGCTGG CGCTATCCGT TCTTCGTCGC CTTTGCCATC AACGTCGTTG CATTATTCAC GCGCCTGCGG CTCGTCATGA CGAAGGAATT CGGAACCCTG CTCGATCTGC ACGAGCTTCA GGCAAGCCGG GTCACGGAGG TGCTCAGAGT AAATGGTGGG CACGTTCTGG TCGGTGCGTT CGTGCCGCTT GCAAGCTTCG CTCTCTTCCA TCTGGTGACA GTGTTCCCGC TGAGCTGGGT GGAAATATAC ACGGATCACG GCGCAAGCTC CTTCCTGATG GTACAGTTCC TTGGCGCGGT GTGCGGGATC GTCGCGATCG TTGCTTCCGG GCTCATTGCC GATCGCATCG GCCGGCGCAA TCATCTTGGT ATGTGCGCGG TGCTGATCGC GATTTTCAGC TTTGTAGCCC CAATGCTGCT TCAGTCGGGC GCGACCGGTC GCGAAATGTT CGTGATTGTC GGCTTCACAA TCCTCGGCCT CTCCTTCGGG CAGGCCGCCG GGGCGGTCGC GTCGCGCTTC GGCAGGGCCT ACCGCTATAC GGGGGCGGCG CTGACCTCGG ATCTGTCGTG GCTGATAGGG GCCGGGTTTG CACCGCTGGT GGCACTTGGG CTCACCAGTC GCTTCGGCCT CGCTTTCGCG GGATATTACC TTCTCTCGGG CGCGCTTTGC ACGATACTGG CGCTTATCTT CAGCAAGACG CTCGAGATCA ACAACGAATA G
|
Protein sequence | MNGAIQQSLG PSSSSMERDA RRIHDDRPVS AGNIAIGVVI GRAAEFFDFF VFGIASVVVF PQLFFPFAPD RLTATLYSFA IFALAFMARP VGSLVFMAID RTYGRGVKLT IALFLLGGST ASIAFLPGYA TLGGWSILLL AVFRLGQGFA LGGAWDGLAS LLALNAPQHH RGWYAMIPQL GAPLGFMLAS ALFAYFLASL SQADFLAWGW RYPFFVAFAI NVVALFTRLR LVMTKEFGTL LDLHELQASR VTEVLRVNGG HVLVGAFVPL ASFALFHLVT VFPLSWVEIY TDHGASSFLM VQFLGAVCGI VAIVASGLIA DRIGRRNHLG MCAVLIAIFS FVAPMLLQSG ATGREMFVIV GFTILGLSFG QAAGAVASRF GRAYRYTGAA LTSDLSWLIG AGFAPLVALG LTSRFGLAFA GYYLLSGALC TILALIFSKT LEINNE
|
| |