Gene Rleg2_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3042 
Symbol 
ID6981787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3101065 
End bp3102192 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content63% 
IMG OID643397752 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002282535 
Protein GI209550618 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0600] ABC-type nitrate/sulfonate/bicarbonate transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATC TGACCTTCTC CTGGCAGGGC GTGACCGCCC TTCTCTTCTG CATGGCAGCA 
CTTGTCACCC TGCCGCTTCT GGCCGAAGGT GCGGCGCAGC CCTTCACCGA CGGCACGGCA
AGCCTCGTTT TCACCATCGT CGTTGCGGCT GCCCTGCTAT CTTTTGCGCC GCAGCCGCCG
GCCTATCGCG CCACCGTGCT GTTCATCGGC GCGCATGGCG CCGCCTGGAT GCTGCTTTCC
GCTCTCTCCG GCAATGAGGG CATGGCAACG CGGGCTTATT TCCTGCTGCT GTTTTCCTGC
TGGCTGCTTG CCTGGCGATG CGTCACCGAG CTGTCGAAAC TGCAGCCCGT CACTGCTTTC
GGCAAATCTG CGCTCCAACT GCTGATCCCG GCGATCTTCG GCGCCTGGAT CCTCATCCTC
TGGGAAGCCG CGACGCGCGG CGCCGGCATT CCCTTCATCA TCCTGCCGCC GCCGAGCGCC
ATCGGTGCCC GTATCATGGC CTCCCTTCCC ATCCTCGGTG CCGACGTCAG GCAGACGATC
TTCAAGGCGG TGCTGATCGG TTATGTCGTC GGCTGCCTTA GCGGCTTTGC CGTCGCGGTG
CTGGCCGACC GCATCACCTT CCTGCGGCGC GGTCTCCTGC CGATCGGCAA CATGGTGTCG
GCCCTGCCGA TCATCGGCGT CGCGCCGGTA ATGGTCATGT GGTTCGGCTT CGACTGGCCG
TCGAAAGCCG CCGTCGTCAT CATCATGACC TTCTTCCCGA TGCTGGTGAA TACCGTCGCC
GGCCTTGCCG CCTCCGGCAG CATGGAGCGC GACCTGATGC GCACCTACGC CTCGAGCGAC
TGGCAGACAC TGCTCAAGCT CAAGCTTCCG GCCGCCATGC CCTTCATTTT CAACGCACTG
AAGATCAACT CGACGCTGGC GCTGATTGGT GCCATCGTTG CCGAATTCTT CGGGACGCCG
ATCGTCGGCA TGGGCTTCCG CATCTCCACC GAGATCGGCC GCATGAATGT CGACATGGTT
TGGGCGGAAA TCGCCATCGC GGCGCTGGCC GGATCGATCT TTTATGGCAT CATCGCCCTG
AGCGAACGGG CGGTGACGTT TTGGCATCCG TCTATCCGTG GTGGCTAG
 
Protein sequence
MRHLTFSWQG VTALLFCMAA LVTLPLLAEG AAQPFTDGTA SLVFTIVVAA ALLSFAPQPP 
AYRATVLFIG AHGAAWMLLS ALSGNEGMAT RAYFLLLFSC WLLAWRCVTE LSKLQPVTAF
GKSALQLLIP AIFGAWILIL WEAATRGAGI PFIILPPPSA IGARIMASLP ILGADVRQTI
FKAVLIGYVV GCLSGFAVAV LADRITFLRR GLLPIGNMVS ALPIIGVAPV MVMWFGFDWP
SKAAVVIIMT FFPMLVNTVA GLAASGSMER DLMRTYASSD WQTLLKLKLP AAMPFIFNAL
KINSTLALIG AIVAEFFGTP IVGMGFRIST EIGRMNVDMV WAEIAIAALA GSIFYGIIAL
SERAVTFWHP SIRGG