Gene Rleg2_6146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6146 
Symbol 
ID6983219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp81428 
End bp82414 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content57% 
IMG OID643399164 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_002283920 
Protein GI209552004 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.485636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCG TATGGAAAGC TGTAGTTGCA GCAGCATGCC TCACCTTTTC AGTGTACGGC 
CAAGCTGGGG CAGCGGAGTT GCCCGGGCAG GGTGTCACCG TTCGCCCCAT CAAGGGTACG
CCCGCGAACA CCTGGTTCCA GCACCTCATA GTCCAGATGG GGCTTGAGAA GCTAGGCTAC
ACGGTTGCGG ACACTCAGGA AGCCGATTTT CCGGCCGTCC ATCTCGCTGT CGGCGCCGGC
GATGCGGACT ACACCGCCGG AAACTGGACC CCGCTGCATG ACGCCTTCTA CGAAAAGTCC
GGCGGTGATT CCACGATGAC CCGGGTTCAC GCGATCATCA CCGGAACAAC GGAGGGCTAC
TACGTCGACA AGAAGACCGC AGAAGCCAAT CATCTCACGA ACATCGACCA GTTGAAGACG
CCAGAGATCG CCAAGCTCTT TGATACAGAT GGCGATGGCA AGGCAAATCT CGCTGGATGC
AATCCGGGAT GGGGATGTGA AGCATCGATC GAGAAGCATC TCGACACCTT CCATCTGCGT
GGTAGCGTGC AGCACGACCA GGGCAGCTAT TTCGCCATCA TGGCCGACGT GATCTCCCGC
TATAAACAGG GCAGCCCTAT TCTCTACTAT ACCTGGACGC CGAACTGGAT CGCCGACGCC
CTCATCGAGG GAAGGGATGT CGTCCGTCTT CAAGTTCCCT CGACTCCGGG GAGCAAGACG
AAGAGCGCCG ACGGTACCGA CTACGGCTTC ATTTCCAGTG ACGTCTACAT TGTCGCCAAC
AACGAGTTTC TCTCGAAGAA TCCCGTTGCA AAGAAGTTTT TCGAGATGGT CAATATTCCC
ATCGCCGACG TGAACGCGGC TCAGATTCCC CTCAGCAAGG GCAATGTGAA GGTCGAAGAC
GTCCGGAAGC AGGCAGAGGC GTGGGTCGCA GCGCATCAGA CCGATTTCGA TAACTGGGTC
GACCAGGCGA AGAAAGCGGC GGAATAG
 
Protein sequence
MKVVWKAVVA AACLTFSVYG QAGAAELPGQ GVTVRPIKGT PANTWFQHLI VQMGLEKLGY 
TVADTQEADF PAVHLAVGAG DADYTAGNWT PLHDAFYEKS GGDSTMTRVH AIITGTTEGY
YVDKKTAEAN HLTNIDQLKT PEIAKLFDTD GDGKANLAGC NPGWGCEASI EKHLDTFHLR
GSVQHDQGSY FAIMADVISR YKQGSPILYY TWTPNWIADA LIEGRDVVRL QVPSTPGSKT
KSADGTDYGF ISSDVYIVAN NEFLSKNPVA KKFFEMVNIP IADVNAAQIP LSKGNVKVED
VRKQAEAWVA AHQTDFDNWV DQAKKAAE