Gene Rleg2_3595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3595 
SymbolaroB 
ID6982356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3720779 
End bp3721909 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content65% 
IMG OID643398320 
Product3-dehydroquinate synthase 
Protein accessionYP_002283088 
Protein GI209551171 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCGA TACCCTCCGC CTCATCAGTC CAGACGGTGC ACGTGCCGCT CGGCGAGCGC 
GCCTACGATA TCCTGATCGG GCCGGGGCTG ATCGCGCGGG CCGGCGCCGA AATCGCCTCC
CGCCTCAAGG GCCGCAAGGC GGCTGTCGTC ACCGATGAAA ATGTCGCGCC GCTCTATCTC
CAGGCGCTCG TCGCAAGTCT CGATGAAGCG GGCATCGCCT CGGCCGCGGT CGTCCTGCCG
GCCGGTGAGA AGACCAAGAG CTTCGAGCAT CTGATGACCG CCTGCGACAA GGTGCTCGAA
GCCCGCGTCG AGCGTAACGA TTGCGTCATC GCGCTCGGCG GCGGCGTTAT CGGCGACCTC
TCGGGATTTG CGGCCGGCAT CGTGCGGCGC GGCGTGCGCT TCGTGCAGGT GCCGACCTCG
CTGCTGGCGC AGGTCGATTC CTCCGTCGGC GGCAAGACCG GCATCAATTC CCGCCACGGC
AAGAACCTGA TCGGCGTCTT CCATCAGCCG GACCTGGTCC TGGCCGATAC CGATGTGCTG
AATACGCTAA GCGAGCGCGA ATTCCGCGCC GGTTACGCCG AGGTCGCCAA ATACGGGCTG
ATCGACAAGC CGGATTTTTT CGCTTGGCTG GAAGCCAACT GGAAGGCGGT TTTCACAGGC
GGCGCCGCCC GCATCGAGGC GATTGCCGCC AGCTGCCAGG CGAAGGCCGA TGTCGTCGTT
GCCGACGAGC GCGAGAACGG TCCGCGGGCG CTGCTCAACC TCGGCCATAC GTTCGGCCAT
GCGCTTGAAA CGGCGACAGC CTATGACAGC TCCCGTCTCG TGCATGGCGA GGGCGTTTCG
ATCGGCATGG TGCTGGCGCA CGAATTCTCT GCGCGGATGA ACCTTGCAAG CCCCGATGAT
GCGCGGCGCG TCGAGCGGCA TCTGCAGGAG GTCGGCCTTC CGACCCGGAT GTCCGACATT
CCGGGCGCGC TGCCGCCGGC CGAAACGCTG ATGGATGCGA TCGCCCAGGA CAAGAAGGTC
AAGAGCGGCA AGCTCACCTT CATCCTGACG CGCGGCATCG GTCAGTCCTT CGTCGCCGAC
GACGTTCCTG CCTCCGAGGT GATCAGCTTT CTCAGGGAAA AACACCCCTA A
 
Protein sequence
MNAIPSASSV QTVHVPLGER AYDILIGPGL IARAGAEIAS RLKGRKAAVV TDENVAPLYL 
QALVASLDEA GIASAAVVLP AGEKTKSFEH LMTACDKVLE ARVERNDCVI ALGGGVIGDL
SGFAAGIVRR GVRFVQVPTS LLAQVDSSVG GKTGINSRHG KNLIGVFHQP DLVLADTDVL
NTLSEREFRA GYAEVAKYGL IDKPDFFAWL EANWKAVFTG GAARIEAIAA SCQAKADVVV
ADERENGPRA LLNLGHTFGH ALETATAYDS SRLVHGEGVS IGMVLAHEFS ARMNLASPDD
ARRVERHLQE VGLPTRMSDI PGALPPAETL MDAIAQDKKV KSGKLTFILT RGIGQSFVAD
DVPASEVISF LREKHP