Gene Rleg2_2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2384 
Symbol 
ID6981123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2445248 
End bp2446303 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content64% 
IMG OID643397097 
Productperiplasmic solute binding protein 
Protein accessionYP_002281885 
Protein GI209549968 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4531] ABC-type Zn2+ transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.691591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.896834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCA CCCTGGGCCC AGCCCTGAAG ATCCTGGCCT TTAAAATTCC GCTCGCCCTC 
GCCCTACCGG CGCTGGCGGT CCCCGCCTTG CTGTTTGCCG GCACCATGCG GGCCGCCGAC
GCGCCTGTGG TCGTCACCTC GATCAAGCCG ATCCATTCGC TGGTTGCGGC GATCATGCAG
GGTGTGGGCG AACCGGAGCT GATCGTCGAT GGCGCCGCCT CCCCGCATAC TTATAGCCTG
AAGCCGTCGA ATGCGCGCGC GCTGCAGGAA GCCAAGGTGA TCTTCTGGAC CGGCCCCGGC
CTCGAGACTT TCCTGGAAAA ACCGCTGCAG GCGCTGGGCT CGAAGGCCAG CATCGCCGAG
CTCGATCATG CCCCCGGCCT CGTCAAGCTG CCCTTCCGCG AAGGCGGCGC CTTCGAGCCA
CATGAGGATG GCGATGAGCA CCATGGCGCT TCCGCCGAGG GTGAGGATCA CGATCATGCA
GCCGGCACCG GGCATGATGA CCATGATCAC GGACATGACG GTGACCATGA CCATGGCGCC
TTCGACACGC ATCTCTGGCT CGACCCGATG AATGCCAAAG CCATGGCCGC CGTGATCACC
ACGACGCTGG TCGCCGCCGA TCCCGCCAAT GCGCTGACCT ACCAGGCCAA TGCCAAGGCG
CTGGACGACA AGCTGACGGC GCTGGATAAG GAAATCGCCG CCACCGTTGC TCCCGTCAAG
GACAAGCCCT TCATCGTCTT CCACGACGCC TACCAGTACT TCGAGCATCG CTACGGCATC
CGCGTCGCCG GCTCGATCAC CGTCAGCCCG GAAACCATTC CCGGTGCCGA GCGTGTTTCG
GAAATCCACC GCAAAGTCGG CGAACTCGGC GCCACCTGCG TCTTTGCCGA ACCGCAATTC
GAGCCGCGCC TCGTCAATGT CGTCATCGAA GGCACGAAGG CCAGATCCGG CGTGCTCGAC
CCCGAAGCGG CAACGCTGAA GGCCGGCCCC GATCTCTACT TCACCCTCAT GCGCGGCATC
GCCGAGAGCA TGAAGGATTG CCTCTCCAAC GCATGA
 
Protein sequence
MKRTLGPALK ILAFKIPLAL ALPALAVPAL LFAGTMRAAD APVVVTSIKP IHSLVAAIMQ 
GVGEPELIVD GAASPHTYSL KPSNARALQE AKVIFWTGPG LETFLEKPLQ ALGSKASIAE
LDHAPGLVKL PFREGGAFEP HEDGDEHHGA SAEGEDHDHA AGTGHDDHDH GHDGDHDHGA
FDTHLWLDPM NAKAMAAVIT TTLVAADPAN ALTYQANAKA LDDKLTALDK EIAATVAPVK
DKPFIVFHDA YQYFEHRYGI RVAGSITVSP ETIPGAERVS EIHRKVGELG ATCVFAEPQF
EPRLVNVVIE GTKARSGVLD PEAATLKAGP DLYFTLMRGI AESMKDCLSN A