Gene Rleg2_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2007 
Symbol 
ID6980746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2066616 
End bp2067989 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content64% 
IMG OID643396729 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002281517 
Protein GI209549600 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGA ATTGGACCCC GAGCAGCTGG CGGCAAAAGC CCATCCTGCA GGTTCCCGAA 
TATCCGGATG CGGCCGCATT GGCAGCAACG GAGGCCACGC TCGCCAGCTA TCCGCCGCTC
GTTTTTGCCG GCGAGGCGCG CCGGCTGAAG AAGCATCTCG CCAACGTCGC CGAAGGCAAC
GGTTTCCTGC TGCAGGGCGG CGACTGCGCC GAGAGCTTCG CCGAACATGG CGCCGATAAT
ATTCGCGACT TCTTCCGCGC CTTCCTGCAG ATGGCCGTCG TGCTGACCTT CGGCGCACAG
CTGCCGGTCG TCAAGGTCGG CCGCATTGCC GGCCAGTTCG CCAAGCCGCG TTCATCGAAT
GTCGAAAAGC AGGGCGACGT GACACTGCCG GCCTATCGCG GCGACATCAT CAACGGCATC
GAGTTCACCG AGGAGTCGCG CATTCCGAAC CCGGAACGCC AGGCGATGGC CTATCGCCAG
TCGGCCGCGA CGCTGAACCT TCTGCGCGCC TTCGCGATGG GCGGCTACGC CAACCTCGAA
AACGTGCATC AGTGGATGCT CGGCTTCGTC AAGGACAGCC CGCAGGGCGA GCGTTACCGC
AAGCTTGCCG ACCGCATCAG CGAAACCATG GATTTCATGA AGGCGATCGG CATCACCTCG
GAAAACCACC CGAGCCTGCG CGAGACCGAT TTCTTCACCA GCCATGAGGC GCTTCTGCTC
GGCTACGAGG AGGCGCTGAC CCGCGTCGAT TCCACGTCGG GCGACTGGTA TGCCACATCG
GGCCATATGA TCTGGATCGG CGACCGTACG CGCCAGGCCG ACCATGCGCA TATTGAATAT
TGTCGCGGCA TCAAGAACCC GATCGGCCTC AAGTGCGGCC CATCGCTGCA GGCCGACGAT
CTGCTGCAGC TGATCGACAT CCTGAACCCG GCGAACGAAG CCGGGCGCCT GACGCTGATC
TGCCGTTTCG GCCATGAGAA GGTCGCCGAA AACCTGCCGC GCCTCATCCG CGCCGTCGAG
CGCGAGGGTC GCAAGGTCGT CTGGTCCTGC GACCCGATGC ACGGCAACAC CATCACGCTC
AACAACTACA AGACCCGGCC TTTCGAGCGG ATCCTGTCGG AAGTCGAAAG CTTCTTCCAG
ATCCACCGCG CCGAAGGCAC GCATCCCGGC GGCATCCATG TCGAAATGAC CGGCAAGGAT
GTGACGGAAT GCACCGGCGG CGCCCGTGCC GTCACCGCCG ACGATCTGCA GGACCGCTAC
CACACTCATT GCGATCCGCG CCTCAACTCC GACCAGGCGC TCGAGCTTGC CTTCCTGCTT
GCCGAGCGCA TGAAGGGCGG ACGCGACGAG AAGCGCATGG TCGCCCACGG CTGA
 
Protein sequence
MAENWTPSSW RQKPILQVPE YPDAAALAAT EATLASYPPL VFAGEARRLK KHLANVAEGN 
GFLLQGGDCA ESFAEHGADN IRDFFRAFLQ MAVVLTFGAQ LPVVKVGRIA GQFAKPRSSN
VEKQGDVTLP AYRGDIINGI EFTEESRIPN PERQAMAYRQ SAATLNLLRA FAMGGYANLE
NVHQWMLGFV KDSPQGERYR KLADRISETM DFMKAIGITS ENHPSLRETD FFTSHEALLL
GYEEALTRVD STSGDWYATS GHMIWIGDRT RQADHAHIEY CRGIKNPIGL KCGPSLQADD
LLQLIDILNP ANEAGRLTLI CRFGHEKVAE NLPRLIRAVE REGRKVVWSC DPMHGNTITL
NNYKTRPFER ILSEVESFFQ IHRAEGTHPG GIHVEMTGKD VTECTGGARA VTADDLQDRY
HTHCDPRLNS DQALELAFLL AERMKGGRDE KRMVAHG