Gene TM1040_1432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1432 
SymbolaroB 
ID4078062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1529139 
End bp1530260 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content64% 
IMG OID638006742 
Product3-dehydroquinate synthase 
Protein accessionYP_613427 
Protein GI99081273 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.68619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.103295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAA CCGTTCACGT TCCCCTTGGC GCGCGCGCCT ATGATGTGGT GATCGGCCCC 
GATCTTGTTG CACAGGCGGG CCAGCGTATT GCGCCCCTCC TGCGCCGCAA GACAGTGGCT
GTGCTCACGG ATGAGACCGT GGCCGCGCTT CATCTTGAGG CTCTGCGCGC GGGACTCGCA
GCCGACGGCA TCGAGATGGA AGCACTTGCC TTGCCGCCCG GCGAGGCCAC TAAAGGCTGG
CCCCAGTTCA CCCGCGCGGT GGAGTGGCTC TTGGACAAGA AAGTCGAGCG CGGCGACATC
GTCATTGCCT TTGGCGGGGG CGTCATCGGC GATCTGGCGG GTTTTGCCGC AGCCGTGCTG
CGCCGGGGCG TTCGTTTTGT CCAGATCCCC ACATCCCTGC TGGCGCAGGT CGACAGTTCC
GTCGGGGGCA AAACCGGCAT CAACGCTCCG CAAGGCAAGA ACCTGATCGG CGCCTTCCAC
CAGCCCAGCC TGGTACTGGC CGATACAGCG GTTCTTGGCA CGCTCACAGA GCGCGATTTT
CTTGCCGGCT ACGGTGAGGT GGTGAAATAC GGGCTCTTGG GCGATGCGGC CTTTTTTGAC
TGGCTCGAAG AAAATGCCCC GGCAATGGCG GCAGGTGACA TGGCGCTGCG GGTCGAAGCC
GTGGCGCGTT CGGTTCAGAT GAAAGCCGAC ATCGTGGCCC GCGACGAAAC CGAACAAGGC
GACCGGGCGC TGTTGAACCT TGGTCATACC TTCTGTCACG CGCTGGAAGC GGCGACCGGC
TACAGCGACC GGTTGCTGCA TGGCGAAGGC GTGGCGATCG GCTGTGCGTT GGCCTTTGAG
CTCTCAGCCC GCCTCGGCCT CTGCAGTCAG GAAGATCCCA GCCGCGTGCG CGCGCACCTC
AAGGCGATGG GCATGAAAAC AGACCTCTCG GACATTCCCG GCGATCTTCC CCCCGCCCAA
GAGCTTCTGG ATCTCATGGC GCAGGACAAG AAGGTCGTGG ATGGTCAGCT GCGCTTCATC
CTCGCGCGCG GCATCGGAGC GGCCTTTGTC ACCGCCGATG TGCCCTCTGA AAAGGTGCTT
GAGGTGCTGC AAGAGGCGCT GGCGCATACA CAACCCGCCT GA
 
Protein sequence
MEQTVHVPLG ARAYDVVIGP DLVAQAGQRI APLLRRKTVA VLTDETVAAL HLEALRAGLA 
ADGIEMEALA LPPGEATKGW PQFTRAVEWL LDKKVERGDI VIAFGGGVIG DLAGFAAAVL
RRGVRFVQIP TSLLAQVDSS VGGKTGINAP QGKNLIGAFH QPSLVLADTA VLGTLTERDF
LAGYGEVVKY GLLGDAAFFD WLEENAPAMA AGDMALRVEA VARSVQMKAD IVARDETEQG
DRALLNLGHT FCHALEAATG YSDRLLHGEG VAIGCALAFE LSARLGLCSQ EDPSRVRAHL
KAMGMKTDLS DIPGDLPPAQ ELLDLMAQDK KVVDGQLRFI LARGIGAAFV TADVPSEKVL
EVLQEALAHT QPA