Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5371 |
Symbol | |
ID | 6978465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1003521 |
End bp | 1004975 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394473 |
Product | permease for cytosine/purines uracil thiamine allantoin |
Protein accession | YP_002279291 |
Protein GI | 209547373 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.583261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.663394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATTC GAAATCCGTC TCCCTCGTTA TACAACGAGG ATCTTGCACC CGCCGAGGAG CGCAAATGGG GTGCATTCAG TATCTTTAAC GTCTGGACAT CCGACGTCCA CAGCCTGTGG GGCTATTATC TGGCGGCGAG CCTGTTCCTG CTGTGCGGCA GCTTCGTGAA TTTCGTCATC GCCATTGGCA TCGGCTCCCT GGTCATCTTC CTTCTGATGA GCATGGTCGG CAATGCGGGC GTGCGCACCG GCGTACCCTT TCCAGTTCTG GCGCGCGCCT CCTTCGGCAC GTTCGGCGCC AACGTTCCGG CCCTGGTCCG GGCGGTGGTC GCCTGCTTCT GGTACGGCGC GCAGACCGCC GCCGCATCGG GTGCGATCGT CGCTCTGCTT ATCCGCAACG AGAGCCTGCT CGCCTTCCAC CAGAATAGCC ATATGCTCGG CCACTCCACC CTCGAACTCA TCTGCTACGT CATCGTCTGG GCTCTGCAGC TGCTGATCAT CCAGCGGGGC ATGGAAACGG TTCGCAAGTT CCAGGATTGG GCCGGTCCGG CCGTCTGGAT CATGATGCTG ATCCTGGCCG TCTATCTGGT CGTCAAATCC GGCACCTTCT CCTTCGGTTC GGAAATCCCG CGCGATGTGC TGATCGAGAA GACCAAGGAT GCCGGCGTGC CGGGCGAGCC CGGCTCGTTT GCAGCACTTG CCGCAGTGGC CGCCACCTGG ATCACCTATT TCGCAGCGCT TTACCTGAAC TTCTGCGATT TCTCGCGTTA CGCGACAAGC GAAAAGGCGC TGCGCAAGGG CAATCTCTGG GGTCTGCCGA TCAACCTACT GGCCTTCTGC CTCGTCGCAG GCGTCACGAC CACGGCCGCC TTCACCGTAT ATGGCGAGGT CCTGCTGCAT CCGGAAATGA TATCGGCGAA ATTCGAAAGC TGGTTCCTGG CGCTGCTTGC GGCACTGACA TTCGCGATCG CCACGCTCGG CATCAACGTC GTGGCGAACT TCGTCTCGCC GGCCTTCGAC TTCGCCAATG TCTTCCCGCG CCAGATCAAC TTCAAACGCG GCGGGTACAT CGCCGCATTG ATCGCCCTCG TGCTCTATCC GTTTGCTCCC TGGGAGACCG GTGCGGCGCA TTTCGTCAAC TTCATCGGGT CGACCATGGG GCCGATCTTC GGCGTCATGA TGGTGGACTA CTATCTCATC CGGAAGAGCC AGCTGAACGT CGAGGCGCTC TACCATGAGA ATGGCGAGTT CCGCTTCCAG AACGGCTGGC ACGGCAATGC CTTCATCGCA TTTGCGGTCG GCGCGCTGTT CTCCTCGATC CTGCCGACCT TCACCAGCAT TCTGCCGAAT TGGTGGGGCA CCTATGGCTG GTTCTTCGGC GTCGGGATCG GCGGGGCGAT CTATTTCGTG CTGAGAGTGG GCGCCCGGCG CAATCCGGCC TTCGCCGCAC GATAA
|
Protein sequence | MSIRNPSPSL YNEDLAPAEE RKWGAFSIFN VWTSDVHSLW GYYLAASLFL LCGSFVNFVI AIGIGSLVIF LLMSMVGNAG VRTGVPFPVL ARASFGTFGA NVPALVRAVV ACFWYGAQTA AASGAIVALL IRNESLLAFH QNSHMLGHST LELICYVIVW ALQLLIIQRG METVRKFQDW AGPAVWIMML ILAVYLVVKS GTFSFGSEIP RDVLIEKTKD AGVPGEPGSF AALAAVAATW ITYFAALYLN FCDFSRYATS EKALRKGNLW GLPINLLAFC LVAGVTTTAA FTVYGEVLLH PEMISAKFES WFLALLAALT FAIATLGINV VANFVSPAFD FANVFPRQIN FKRGGYIAAL IALVLYPFAP WETGAAHFVN FIGSTMGPIF GVMMVDYYLI RKSQLNVEAL YHENGEFRFQ NGWHGNAFIA FAVGALFSSI LPTFTSILPN WWGTYGWFFG VGIGGAIYFV LRVGARRNPA FAAR
|
| |