Gene Rleg_5981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5981 
Symbol 
ID8016339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp10270 
End bp11340 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content59% 
IMG OID644827293 
ProductABC transporter related 
Protein accessionYP_002978493 
Protein GI241258609 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTA TCACCCTCGA TCATATTCGA CATGCCTATG GGCCGAACCC GAAGAGCGAG 
AAGGACTACG CTCTCAAGGA AGTGCACCAC GAGTGGAACG ATGGCGGTGC CTATGCGCTG
CTCGGACCTT CAGGCTGCGG AAAGACCTCG CTGCTCAATA TCATTTCCGG CCTTATTCAG
CCCTCCGAAG GACGAATTCT CTTCGACGGA CAGGATGTTA CGAACCTGCC GACGCAGCAG
CGAAATATTG CGCAGGTATT CCAGTTTCCG GTCATCTACG ACACGATGAC CGTCTATGAC
AATCTGGCCT TCCCCTTGCG CAACCGCGGA GTCGCGGAGC CTGATGTTGA TCGTCGTGTC
CGCGAAATAT TGGAGATGAT TGATCTTGCA GATTGGGCCA AGCGTCGCGC GCGCGGTTTG
ACGGCGGACC AAAAGCAGAA GATTTCGCTC GGCCGCGGCC TGGTGCGCTC GGATGTGAAC
GCGATTCTCT TTGACGAGCC GCTCACTGTT ATCGATCCGC ATATGAAATG GGTGCTGCGA
TCGCAGCTGA AGCGGCTGCA TAAGCAGTTC GGTTTTACAA TGGTCTATGT CACGCATGAC
CAGACGGAGG CGCTGACCTT CGCCGACAAA GTCGTGGTGA TGTACGATGG CGAGATCGTG
CAGATCGGCA CGCCGGCCGA GCTCTTCGAG CGTCCGAGTC ATACCTTCGT CGGCTACTTC
ATCGGTTCTC CGGGCATGAA CTTCATGCCA GCCAAGGTGG AAGGCCGCAC GGTTCGGGTC
GGCGAGCATG CGCTGACGCT CGACTATGCG CCAAAGATTT CGGCAGCGGC CAAGGTAGAG
CTTGGAATCC GGCCCGAGTT TGTTCGGGTC GGCCGCGAGG GCATGCCTGT GACCGTCAGC
AAGGTGGAAG ATATCGGCCG GCAGAAGATC GTCCGCGCGC AGTTTGCCGG CCAGCCGATC
GCGATAGTCG TCCCTGAGGA CGAGGACATT CCGGCTGATC CGCGGGTGAC CTTCGAGCCA
TCGGGTATCA GTATCTATGC CGACTCTTGG CGCGCCGGAC CGGAGGCTTG A
 
Protein sequence
MARITLDHIR HAYGPNPKSE KDYALKEVHH EWNDGGAYAL LGPSGCGKTS LLNIISGLIQ 
PSEGRILFDG QDVTNLPTQQ RNIAQVFQFP VIYDTMTVYD NLAFPLRNRG VAEPDVDRRV
REILEMIDLA DWAKRRARGL TADQKQKISL GRGLVRSDVN AILFDEPLTV IDPHMKWVLR
SQLKRLHKQF GFTMVYVTHD QTEALTFADK VVVMYDGEIV QIGTPAELFE RPSHTFVGYF
IGSPGMNFMP AKVEGRTVRV GEHALTLDYA PKISAAAKVE LGIRPEFVRV GREGMPVTVS
KVEDIGRQKI VRAQFAGQPI AIVVPEDEDI PADPRVTFEP SGISIYADSW RAGPEA