Gene Rleg_5706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5706 
Symbol 
ID8016669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp288049 
End bp289488 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content59% 
IMG OID644827857 
Productpermease for cytosine/purines uracil thiamine allantoin 
Protein accessionYP_002979057 
Protein GI241518429 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.829822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.11286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAT CCGCCAATTT CACCTCAGAC GAACTGTCGG CATACGACGT CCATGGCATC 
GCGCCGGTTC CAGTGTCGCA CCGAACGTCA TCACCACTCG ACCAATTCTG GATCTGGGCC
GGCGCGAACG TCGCTCCGAT CAACTGGGTT CTGGGAGCGC TCGGAATCCA GATGGGGCTT
AGCCTCTGGG ATACGTTTCT CGTCATTGCG ATCGGCAATC TCGTGGGGGC AGCGCTTTTC
GCCACATTCT GCCTCATGGG ATACCGAACC GGCGTTCCGC AGATGGTTCT CACTCGGCTC
GCCTTCGGTC GCCGCGGCGC CTACCTGCCG ACTTTCGTGC AGCTTCTCAT GGCGATGGGC
TGGGTGGCGA CGAACACCTG GATCGTACTC GATCTTTCGG TCGCTGCGCT CGATCGCATG
GGAATTGGTG GTGGGATGGA GGTGAAATAT GCGATCGCCC TCGTCATCAT GGTGGTCCAG
ATCGGGATAG CCGCCTGGGG CTTCAATGCG ATCAAGTATT TCGAGCGCTA CACGATGCCG
GCGATCCTGC TGATCATGGT CGCGATGACA TTCATGGCGT CTTTTACGGT TGATATCCAA
TGGTCCACCT CAACAGTCAC GGGTATCGCG CGGTGGTCCG CGATGAGCCA GCTGATGACG
GCAATCGGTA TTGGCTGGGG CATATCCTGG CTGGTCTATG CGTCTGACTA CACGCGGTTT
TCAAAACCAG GCTTGAAACC GTCCAGCGTC TTCAAGGCAA CGTTCCTGGG AATGTTCGTG
CCCACGGTCT GGCTCGCCAC GCTCGGCGCG GCGATTGCTT CCGCCGGCGT CGGGTCCGAT
CCGGCCCAGC TCATCATCGC GGCGTTTGGC GTGATGGCCT TGCCGGTCCT GCTCGTGCTG
GTTCACGGGC CGATCGCAAC GAACATCGTC GTCATTTATT CCGCCGCGCT TTCAAGCCTT
GCCATGGATA TCAACAAGCC CCGCTGGGTC GTTTCGCTCG CTTGCGGCGT TGCGGGATCG
ATCATTCTCT ACGGCTTCAT GCAGTCGCAA GATTTCGCGC ATGCGTTCGA AACGTTCATG
GTGACGATGG TCGTCTGGAT CAGCCCCTGG GCTGGCGTAA CCGCTGCCGA TTTCTTCATT
ATGCGCCGCG GGTCGATCAA TGTCGACGAA TTGTACAAGC CGCACACGAC GAGCCGGCTC
GGTGACGTCA ACTGGACCGG CGTCCTGTCA TTGCTGGTCG GCGTCTTCGC TGCCTATCTC
TTCCAGATGA GCGTCGTCGA AGTCCTTCAA GGCCCCCTTG CCCTTGGCCT GGGCGGAATT
GATCTTTCCT GGCTGGCCGG CTTTGTCGTC GCGTTCATGG CCTACATCGT TTCTCACAAG
CTCCGAAGGA CCGCGGATGT CGGCGCGTCC GCACTGACGT TGCCTGTCCG TGGCGAATGA
 
Protein sequence
MSKSANFTSD ELSAYDVHGI APVPVSHRTS SPLDQFWIWA GANVAPINWV LGALGIQMGL 
SLWDTFLVIA IGNLVGAALF ATFCLMGYRT GVPQMVLTRL AFGRRGAYLP TFVQLLMAMG
WVATNTWIVL DLSVAALDRM GIGGGMEVKY AIALVIMVVQ IGIAAWGFNA IKYFERYTMP
AILLIMVAMT FMASFTVDIQ WSTSTVTGIA RWSAMSQLMT AIGIGWGISW LVYASDYTRF
SKPGLKPSSV FKATFLGMFV PTVWLATLGA AIASAGVGSD PAQLIIAAFG VMALPVLLVL
VHGPIATNIV VIYSAALSSL AMDINKPRWV VSLACGVAGS IILYGFMQSQ DFAHAFETFM
VTMVVWISPW AGVTAADFFI MRRGSINVDE LYKPHTTSRL GDVNWTGVLS LLVGVFAAYL
FQMSVVEVLQ GPLALGLGGI DLSWLAGFVV AFMAYIVSHK LRRTADVGAS ALTLPVRGE