Gene Rleg2_6455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6455 
Symbol 
ID6983526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp119442 
End bp120527 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content52% 
IMG OID643399452 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_002284208 
Protein GI209552293 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5039] Exopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.854254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.145413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCATC TCGGTAAAGT TGTAATACAG CGTCAACAAG TGAAGCTGAT TAAATCGGTA 
CAAAAATTTG TAAAACGAGG CCAACCTTAC ATCATAATGA ACTTCCCTTC GAATAAAGAT
CCAGGCCATA ACGCGAGTTG GCTAGGATTA GCTCATATAT TACGGGAAGT AACAGGGCGT
CTTCCAGTGT TTACCGGTAG CAATGCCGAT AACATCAACC AGATCAAATC CACGCCAGGC
GATGCTCCAA TATTCGTTTG CGGCTGGGGT AACCTTGGAG ATGCACGGAC GGGTCGGGAC
GATATTCTCT ATCGCTTGCT GTGGAAATAT CCCGATCGAA CGATAGTTCA AATGCCGCAA
ACCATCGATT TCGGCAGCGA AGCGCTTGCA GCGTATGCAA AACGTACAAT CGCAAGCCAT
CGCAGATTCT TCCTAATGGC GCGCGATAAT CAGAGCTTTG AGGTAGCAAA GGCCAGCTTC
GATTGTCATG TTGAGGCAGT CCCGGATACG GCATTCGGTA TTGAAAGACT TAAACCATTC
GAAGCTGACC CACTCAGCCT GCTGTATGTC ATGCAGCCCT TTGGAGAGGA CGGCGTTGAT
ATAGGCAGAG CGCGAGCGAT TGCTGACGGC CCTCTTACCG ATTGGAGCAA CGGCCCAAAG
GGCTTGTCCC GCATGCGCCA GTCGAGTCTC GTCAAGGCAG CGCGGCGACG TGGCTTTAGG
CGCGTCGAGA TGGTGGCCCA ACATCATGAG GACGTTGCAG CACGTTATGT CAATTACGGC
GCGAAGATCC TGTCCAGCGC GCAGCGGATA ATCACCAACA GTTTGCACGG GCACATCCTC
TGTCTCTTGC TCAATAAGCC CCACATCGCG GTTGCGAGCA ACGGAAGCAG GCTCCACGAC
TTCATCGGCA GTTGGACTGG TGACAGCCCT CTGGTTGAGA AAGCAACGAG CACTAGTGAG
ATTTTAGCGG CCATGTCTCG TCTTCCATAT GAGTTGAACG GCACCTGGTA CAAGGCGAGC
AGACCGATGG ATCTCAACCC CAGCCTTCCC GCCCCAGATC TGGTGCCTCA TCCCTCCATC
GCCTGA
 
Protein sequence
MYHLGKVVIQ RQQVKLIKSV QKFVKRGQPY IIMNFPSNKD PGHNASWLGL AHILREVTGR 
LPVFTGSNAD NINQIKSTPG DAPIFVCGWG NLGDARTGRD DILYRLLWKY PDRTIVQMPQ
TIDFGSEALA AYAKRTIASH RRFFLMARDN QSFEVAKASF DCHVEAVPDT AFGIERLKPF
EADPLSLLYV MQPFGEDGVD IGRARAIADG PLTDWSNGPK GLSRMRQSSL VKAARRRGFR
RVEMVAQHHE DVAARYVNYG AKILSSAQRI ITNSLHGHIL CLLLNKPHIA VASNGSRLHD
FIGSWTGDSP LVEKATSTSE ILAAMSRLPY ELNGTWYKAS RPMDLNPSLP APDLVPHPSI
A