Gene Rleg2_5905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5905 
Symbol 
ID6977292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp321984 
End bp323123 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID643393358 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002278176 
Protein GI209546286 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.648846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCTT TCGACTCCTC GCCGCCACCG CCGACGACGA CCGAAACCTC GGTCAGCAAG 
CCATCACGCG GGCATGAGAG CTATATTGCC CTGGTCTGGC GCCGGCTGCG GCGCTCCTGG
ACCGGCATGG CCGGCCTCAT CCTCGTCGGC CTGCTGCTCT TCATGGCGAT CTTTGCCGAT
TTCGTCGCGC CGATGGACCC GAAGGCCACC GATGTCGGTT TCGCGCCGCC GCAGATGATG
AGCTTCCACG ACAAGGACGG CAATTTCGTC TTCCAGCCGC GCGTCTATGG CCTGGCCGAT
TCCGATGCGC TCGACCCGGT CACCTTCCAG CCGATCGTCG GCGTCGATTA CGACAATCCG
CGGCTGCTCG GCTTCTTCGT CAAGGGATCG GAATACCGGC TTTTCGGCCT GATCCCGGCC
GACCGCCACT TCTTCGGCTC GACCGACGGC CAGCCGGTGC ATTTCCTCGG CACCGACAAG
TTCGGCCGCG ACGTGCTGTC GCGCGCCATC ATCGGCTCGC GCATCTCGCT GATGATCGCG
CTGACCGTCG TCTTCATCGT CACCGTCATC GGCACGACGG TCGGCATGGT CTCCGGCTAT
TTCGGCGGCA CTTTCGATGT CTGGCTGCAG CGTTTCGTCG AGCTGGTGCT GGCCTTCCCG
CAGCTGCCGC TCTATCTGGC GCTGACGTCG CTGATCCCGG TGACGGCGCC GACCAACGTC
TTCCTCGCCT TCGTCATCAT CGTCATGTCG GCGCTCGGCT GGGCGCAGAT GTCGCGCGAG
GTGCGCGGCA AGACCTTGGC GCTTGCCCGC ATCGACTATG TCCGGGCGGC AATGGCGGTC
GGCGCCACCG ACAAGCGCAT CATCATGCAG CATATCTTCC CGAACGTGAT GAGCCACGTC
ATCGTCGCGG TGACGCTCGC CATACCGAGC GTCGTGCTGC TCGAATCCTT CCTCGGTTTC
CTCGGCTTTG CCGTCAAGCC GCCGCTGATT TCCTGGGGGC TGATGCTGCA GGATACGGCG
ACCTATTCGG TCATCGGCTC CTATCCCTGG ATTCTCTCTC CCGTCGGCTT CGTGCTCGTC
ACCGTCTTCG CCTTCAATGC GCTGGGCGAT GGACTGCGCG ACGCGGTCGA TCCTTATTGA
 
Protein sequence
MLAFDSSPPP PTTTETSVSK PSRGHESYIA LVWRRLRRSW TGMAGLILVG LLLFMAIFAD 
FVAPMDPKAT DVGFAPPQMM SFHDKDGNFV FQPRVYGLAD SDALDPVTFQ PIVGVDYDNP
RLLGFFVKGS EYRLFGLIPA DRHFFGSTDG QPVHFLGTDK FGRDVLSRAI IGSRISLMIA
LTVVFIVTVI GTTVGMVSGY FGGTFDVWLQ RFVELVLAFP QLPLYLALTS LIPVTAPTNV
FLAFVIIVMS ALGWAQMSRE VRGKTLALAR IDYVRAAMAV GATDKRIIMQ HIFPNVMSHV
IVAVTLAIPS VVLLESFLGF LGFAVKPPLI SWGLMLQDTA TYSVIGSYPW ILSPVGFVLV
TVFAFNALGD GLRDAVDPY