Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6839 |
Symbol | |
ID | 8022422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 284287 |
End bp | 285513 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644833705 |
Product | protein of unknown function DUF1228 |
Protein accession | YP_002984839 |
Protein GI | 241666755 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0480421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.134894 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAAAC ACAGCCCCAG CCCATATGTG GTCGTGCTCG CGGGATCGGC GTCCCTCGCC ATCGCCATGG GCGTCGGTCG GTTCGCATTC ACTCCAATTC TGCCGATGAT GCTACACGAC GGCGTCGTTG ATCTTTCGCG AGCCGGAGGA CTCGCGACTG CCAACTACGT CGGCTACCTG GTCGGTGCAT TGGCAGCGAT GGCAGTACCC AAGAGGTGGG ACCATACCTT CGTCATACGC CTGACACTGG TGGCCACGGT TCTGCTTACG GCTCTGATGT CAGTGCCCTA TGCCGAGGCC TGGGTCGCTC TTCGGTTCTT GGCGGGCGTC GCTTCGGCGA TCGGATTCGT CTTCACCTCC GGTTGGTGTC TCGCCCAGCT TTCGGGAACC GGCAGCTCGA TCGGAAGCGC CATATTCACG GGACCCGGCG CAGGCATCGC CGTGTCCGGG CTGGCCGCCA GCGGGATGAC CATTCTCGGG CTGTCCGGCC ACACCGCCTG GCTGATATTC GCCGCGATCT CCGCCACGAT CAGCGGGATC ATCTGGAAGA CCTTCGGCGA GAGCGCGAAG CCATCCGACG CCTATTCGGT GGGGGCGCCG ACGCGCGCCT CCGGCAAGGT GCCGAAATCG GAAATGCCTC TATTTGCGAT CGCATACGGG CTGGCCGGCT TCGGCTACAT CGTCACGGCG ACCTATCTTC CGGTGATCGC GAAGAACAGC ATTCCTGGTT CACCCTTGCT CACCGTCTTC TGGCCGCTTT TCGGGGTCGC GGCAGTCGTC GGATCGCTGC TGGCGGCGCG CGTTCCGCAT AGCGCCGACG TGCGGCTCCA TCTGATTGCC GCATACCTCG TGCAGGCGGT CGGGGTGGGG CTGTCGGTTA TCTGGCAGGA CGCTTTCGGC CTTGCACTCA GCAGCGTTCT CGTAGGCCTG CCGTTCACTG CGATCAGCTT CTTCGCCATG AACGAAGTTA GGCGGATCAG ATCGAGCCAC CACGCGCGTT ACATGGGACT GCTGACGGCG GTGTTCGCGA TCGGACAGAT CATGGGGCCG CCTGCCGTAG GAGTGATCAT GAGGCATGTG GTGAACGTAG ACGCCGGGTT CGATCTTGCG CTTGCCGTCG CCAGCATCGC GCTCGTAGTC GGCGCCGCGA TCTATGTTGC GATGATCCTG CTGTTTCCAA GCGAGCGGAA TGCGAGGACC GCAGGGCGCT CGGTCCGTCC CACTTGA
|
Protein sequence | MSKHSPSPYV VVLAGSASLA IAMGVGRFAF TPILPMMLHD GVVDLSRAGG LATANYVGYL VGALAAMAVP KRWDHTFVIR LTLVATVLLT ALMSVPYAEA WVALRFLAGV ASAIGFVFTS GWCLAQLSGT GSSIGSAIFT GPGAGIAVSG LAASGMTILG LSGHTAWLIF AAISATISGI IWKTFGESAK PSDAYSVGAP TRASGKVPKS EMPLFAIAYG LAGFGYIVTA TYLPVIAKNS IPGSPLLTVF WPLFGVAAVV GSLLAARVPH SADVRLHLIA AYLVQAVGVG LSVIWQDAFG LALSSVLVGL PFTAISFFAM NEVRRIRSSH HARYMGLLTA VFAIGQIMGP PAVGVIMRHV VNVDAGFDLA LAVASIALVV GAAIYVAMIL LFPSERNART AGRSVRPT
|
| |