Gene Rleg2_5420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5420 
Symbol 
ID6978514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1062581 
End bp1064131 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content63% 
IMG OID643394522 
Productpolar amino acid ABC transporter, inner membrane subunit 
Protein accessionYP_002279340 
Protein GI209547422 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1126] ABC-type polar amino acid transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00342804 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGATCT CGATGAACTG GCTTGAAAAT CTGCGCCGCA GCTTCCTGGA TTGGGACGCC 
ATGGCGGAAG TGCTGCCGAG CATGATCAGC GTCGGCCTCA AGAACACCCT GATCCTGGCC
GCCGCCTCGA CGGTGCTCGG CGTCGTCATC GGCATGGCTC TCGCCGTGAT GGGCATCTCG
CAGTCCCGCT GGCTGCGGCT GCCCGCGCGC ATCTACACCG ATATCTTTCG CGGCTTGCCG
GCGATCGTCA CGATCCTGAT CATCGGTCAG GGGTTTGCCC GGATCGGACG GGAGATCTTC
GGCCCGTCGC CATTCCCGCT CGGCATCCTG GCGCTCAGCC TGATCGCCGG GGCCTATATC
GGCGAAATCT TCCGCTCGGG CATCCAAAGT GTCGAGCGCG GCCAGATGGA AGCCTGCCGG
GCGCTGAGCA TGAGCTACGG ACAGGGCATG CGCCTGATCG TTATTCCGCA AGGCATCCGG
CGCGTTCTGC CGGCGCTGGT CAATCAGTTC ATCGGAAACG TCAAGGATTC CAGCCTCGTC
TATTTCCTCG GATTGCTCGC CTCCGAGCGT GAGATCTTCC GCGTCGGACA GGACCAGGCT
GTCGTAACGG GCAATCTGTC GCCCTTGCTG CTGGCGGGCC TCTTCTATCT CGTCATCACC
GTGCCGCTCA CCCATTTGGT CAACTATATC GACGTGAGAC TGCGCCTGGG AAAACAGGGC
CGCGGTACGG GTGCTGCGAG TGGTCTGGCC GAGGTCAGCG AGTTGCAGGC CGCTGCCGTC
CCGCAGCCCG CGGGCAAATC TTCGGCAGAG ACGAAACCGC GCTTCCAGGC CGGCGCCCTG
AATATCCGGG ATCTCAGCAT GGCCTATGGC GATTTCGACG TGCTGAAGGG CGTCGACCTC
GACATCGCCG CCGGGACCGT CACCTGCATC ATCGGCCCCT CCGGCTCGGG CAAATCGACG
CTTTTGCGCT GCATGAACAG GCTGGTGGAA CCGAAAGGCG GTGACATCCT GCTCGATGGC
AACAGCATTC TGGCCATGAA ACCGGAACGG TTGCGCCGGC GGGTGGGGAT GGTGTTCCAG
CACTTCAACC TCTTTCCCGA TCACACTGCC CTTGAGAACG TCATGCTTTC CCTGACGAAG
ATCAAGAAGA TGCCGAGGCA GGAGGCGCGA CGCATCGCCG AGGCGCGTTT GGCCGAGGTC
GGTCTTGCCG AGCGCAGGGA TCATCGCCCG GCAGGTCTAT CGGGTGGGCA GCAGCAGCGC
GTCGCCATCG CGCGAGCTCT TGCCATGGAT CCGGAGCTTA TCCTGTTCGA CGAAGTGACG
AGTGCGCTCG ATCCCGAACT GGTCAAGGGC GTTCTCGACC TGATGGCGGC GCTTGGCCGC
CAGGGCATGA CCATGGCCGT CGTCACGCAC GAAATGGGAT TTGCGCGCAG GGTCGCCGAT
CAGGTGGTGT TCATGGATGA GGGCCGGATC GTCGAGGCCG GTTGCCCACA GCAGATCTTC
GACAATCCCC AAAGCGAGCG GCTTAAGCGC TTCCTTGCGG AAGTCCTCTA G
 
Protein sequence
MGISMNWLEN LRRSFLDWDA MAEVLPSMIS VGLKNTLILA AASTVLGVVI GMALAVMGIS 
QSRWLRLPAR IYTDIFRGLP AIVTILIIGQ GFARIGREIF GPSPFPLGIL ALSLIAGAYI
GEIFRSGIQS VERGQMEACR ALSMSYGQGM RLIVIPQGIR RVLPALVNQF IGNVKDSSLV
YFLGLLASER EIFRVGQDQA VVTGNLSPLL LAGLFYLVIT VPLTHLVNYI DVRLRLGKQG
RGTGAASGLA EVSELQAAAV PQPAGKSSAE TKPRFQAGAL NIRDLSMAYG DFDVLKGVDL
DIAAGTVTCI IGPSGSGKST LLRCMNRLVE PKGGDILLDG NSILAMKPER LRRRVGMVFQ
HFNLFPDHTA LENVMLSLTK IKKMPRQEAR RIAEARLAEV GLAERRDHRP AGLSGGQQQR
VAIARALAMD PELILFDEVT SALDPELVKG VLDLMAALGR QGMTMAVVTH EMGFARRVAD
QVVFMDEGRI VEAGCPQQIF DNPQSERLKR FLAEVL