Gene Rleg_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2226 
Symbol 
ID8013233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2230530 
End bp2231903 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content63% 
IMG OID644824812 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002976042 
Protein GI241204946 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.435379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.755541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA ATTGGACCCC GAGCAGCTGG CGGCAAAAAC CGATCCTGCA GGTTCCCGAA 
TATCCCGACG CAGCCGCTTT GGCGGCAACG GAAGCCACGC TCGCCAGCTA TCCGCCGCTT
GTCTTTGCAG GTGAGGCGCG CCGCCTGAAG AAACATCTCG CCAATGTCGC CGAAGGCAAC
GGCTTCCTGC TGCAGGGCGG CGACTGCGCC GAGAGCTTCG CCGAACACGG TGCCGACAAT
ATCCGCGACT TCTTCCGCGC CTTCCTGCAG ATGGCCGTGG TGCTGACCTT CGGGGCTCAG
CTGCCGGTCG TCAAGGTCGG CCGCATCGCC GGCCAGTTCG CCAAGCCGCG GTCGTCGAAT
GTCGAGAAGC AGGGCGATGT GACGCTGCCG GCCTATCGCG GCGACATCAT CAACGGTATC
GAGTTCACCG AGGAATCGCG CATTCCGAAC CCGGAACGCC AGGCCATGGC CTATCGCCAG
TCGGCTGCGA CGCTGAACCT ATTGCGTGCC TTTGCGATGG GCGGTTATGC CAATCTCGAA
AACGTGCATC AGTGGATGCT CGGCTTCGTC AAGGACAGCC CGCAGGGCGA GCGCTACCGC
AAGCTTGCCG ACCGCATCAG CGAAACCATG GATTTCATGA AGGCGATCGG TATCACCTCG
GAAAACCAGC CGGCGCTGCG CGAGACCGAT TTCTTCACCA GTCACGAGGC GCTGCTGCTC
GGCTACGAGG AAGCGCTGAC CCGCGTCGAT TCCACCTCGG GCGACTGGTA TGCAACCTCT
GGCCACATGA TCTGGATCGG CGACCGCACG CGCCAGGCCG ACCATGCGCA TGTCGAATAT
TGCCGCGGCA TCAAGAACCC GATCGGCCTG AAATGCGGCC CTTCGCTGCA GGCCGACGAC
CTGCTGCAAC TGATTGACAT CCTGAATCCT GCCAACGAGG CCGGGCGCCT GACGCTGATC
TGCCGCTTCG GCCATGAGAA GGTCGCCGAC AGCCTGCCGA AACTCATTCG CGCCGTTGAG
CGCGAGGGCC GCAAGGTCGT CTGGTCCTGC GATCCGATGC ACGGCAACAC GATCACGCTC
AATAACTACA AGACCCGTCC CTTCGAGCGG ATCTTGTCGG AAGTCGAAAG CTTCTTCCAG
ATCCACCGCG CCGAAGGCTC GCATCCGGGT GGCATCCATA TCGAGATGAC TGGCAAGGAC
GTGACCGAGT GCACCGGCGG CGCCCGCGCG GTCTCCGCCG AAGACCTGCA GGACCGCTAT
CACACCCATT GCGATCCGCG CCTCAACGCC GACCAGGCGC TCGAGCTGGC CTTCCTGCTT
GCCGAGCGCA TGAAGGGTGG TCGCGACGAA AAGCGCATGG TCGCCAACGG CTGA
 
Protein sequence
MADNWTPSSW RQKPILQVPE YPDAAALAAT EATLASYPPL VFAGEARRLK KHLANVAEGN 
GFLLQGGDCA ESFAEHGADN IRDFFRAFLQ MAVVLTFGAQ LPVVKVGRIA GQFAKPRSSN
VEKQGDVTLP AYRGDIINGI EFTEESRIPN PERQAMAYRQ SAATLNLLRA FAMGGYANLE
NVHQWMLGFV KDSPQGERYR KLADRISETM DFMKAIGITS ENQPALRETD FFTSHEALLL
GYEEALTRVD STSGDWYATS GHMIWIGDRT RQADHAHVEY CRGIKNPIGL KCGPSLQADD
LLQLIDILNP ANEAGRLTLI CRFGHEKVAD SLPKLIRAVE REGRKVVWSC DPMHGNTITL
NNYKTRPFER ILSEVESFFQ IHRAEGSHPG GIHIEMTGKD VTECTGGARA VSAEDLQDRY
HTHCDPRLNA DQALELAFLL AERMKGGRDE KRMVANG