Gene Rleg_4721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4721 
Symbol 
ID8007196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp91167 
End bp92228 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content60% 
IMG OID644821654 
ProductABC transporter related 
Protein accessionYP_002972914 
Protein GI241113079 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCATG TTTCGGTCAA CAATGCGCGC AAGGATTACG GCGCGTTCAA AGCCATAAAA 
GGCGTCTCGG TCGATATCGG CGACGGCGAG TTCGTCGTTC TGGTCGGTCC CTCCGGCTGT
GGCAAATCCA CGCTTCTGAG AATGATCGCG GGCCTCGAGG GTATCACCTC GGGGCAGATC
CAGATCGGCA AGCATATCGT CAACGAGCTT GCCCCCAAGG ATCGGGACAT CGCGATGGTG
TTCCAGAATT ATGCGCTCTA TCCGCACATG ACCGTTGCCA AGAACATGGG GTTTTCGTTG
CGGCTGAAAC GAATGCCGCG CACGGAGATC GATCAGCGGG TCGGCAACGC CGCGAAGATC
CTCGGTCTCG AAAGTCTATT GGAGAGATAC CCCAAGCAAC TGTCGGGCGG CCAGAGACAG
CGTGTGGCGA TGGGGCGGGC AATCGTGCGC GACCCGGCCG TCTTCCTCTT CGATGAACCC
CTGTCGAACC TCGACGCCAA GCTCAGGGTG CAGATGCGCT CGGAGATCAA GGAATTGCAT
CAACGGCTGC AGACGACCAC CATCTATGTC ACCCACGACC AGATCGAAGC CATGACCATG
GCCGACAAGA TCGTCGTCAT GAAGGACGGG CTGATCGAGC AGTCGGGTTC TCCGTTGGAA
TTGTACGATC GTCCGAACAA CCTTTTCGTC GCCGGCTTCA TCGGCTCCCC GGCGATGAAT
TTCATCAGCG GCAACATGAC GGAAGATGGG TTTCGAACCG CCGACGGCCT ACTCCTGCCG
AGTGAGCGCC GTCCGGCAGA TGCCGCGATC TACGGCATTC GCCCCGAACA TATCCGGTTG
GACCCAGGCG GCATCGAGGT AACGACGGTG GTCGTCGAGC CCACGGGTTC GGAAACGCTC
GTCATCGTCC GGCTGGGGAC GCAGACGCTG ACCTGTGTCT TCAGGGAACG GATCAGGGCC
GCCCCCGGCG AGGTGCTGAG GATTGCACCA ATCCATGATG CGGTTCACCT GTTTGCCGGA
AACGAGCAGC GGATCACATC AGGCGAAGCC CCGTTGAACT GA
 
Protein sequence
MAHVSVNNAR KDYGAFKAIK GVSVDIGDGE FVVLVGPSGC GKSTLLRMIA GLEGITSGQI 
QIGKHIVNEL APKDRDIAMV FQNYALYPHM TVAKNMGFSL RLKRMPRTEI DQRVGNAAKI
LGLESLLERY PKQLSGGQRQ RVAMGRAIVR DPAVFLFDEP LSNLDAKLRV QMRSEIKELH
QRLQTTTIYV THDQIEAMTM ADKIVVMKDG LIEQSGSPLE LYDRPNNLFV AGFIGSPAMN
FISGNMTEDG FRTADGLLLP SERRPADAAI YGIRPEHIRL DPGGIEVTTV VVEPTGSETL
VIVRLGTQTL TCVFRERIRA APGEVLRIAP IHDAVHLFAG NEQRITSGEA PLN