Gene GYMC61_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_0474 
SymbolaroB 
ID8524280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp476933 
End bp478033 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content59% 
IMG OID 
Product3-dehydroquinate synthase 
Protein accessionYP_003251638 
Protein GI261417956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAAC GGACGATTGA AACGGCGACG AAGCGCTATC CGCTTCTTTT GGGCGATGGG 
GCGGCCCGCG TGCTGCCAAG CTTGCTTCGG TCGCTCTCCT GTCCGCCAGG GACAAAACTT
TTCATCGTTA CCGACGATAC TGTGGCGCCT CTTTATTTGG ATGAGGTGCG GGCGTTGCTT
GCCGCCGCTG AGTATGACGT GTACGCCTAC GTCATTCCAA GCGGGGAGGC GGCCAAGTCA
TTTGATCATT ATTACGCTTG CCAGACAGCG GCATTGCAGT GCGGCCTCGA CCGCCGTTCG
GTTATCATTG CGCTTGGCGG CGGCGTTGTC GGCGATTTGG CTGGGTTTGT CGCCGCCACC
TATATGCGCG GCATCCGTTA CATTCAAATG CCGACGACAC TCTTGGCCCA TGACAGCGCC
GTCGGCGGCA AAGTCGCCAT CAACCATCCG CTCGGCAAAA ATATGATCGG CGCCTTCCAT
CAGCCGGAAG CAGTTGTGTA TGACACCTCC TTTTTGCGCA CGCTGCCCGA GCGCGAGCTT
CGGTCCGGGT TTGCCGAGGT GATCAAACAT GCGCTGATCC GCGACCGCCG CTTTTACGAC
TGGCTGCGCG CGGAAATCAA GACGCTCGCC GACTTGCGCG GCGAGAAACT CGCCTATTGC
ATTGAAAAGG GCATTGACAT TAAGGCGTCC GTCGTGCGTG AGGATGAAAA AGAAACCGGG
GTGCGCGCCC ATTTGAATTT CGGCCATACG CTCGGCCATG CGCTGGAAAG CGAGTTAGGC
TACGGCGCGC TCACTCACGG GGAGGCGGTT GCGGTTGGCA TGCTGTTTGC CGTCTTTGTC
AGCGAACGGT TTTACGGCCG GTCGTTCGCT GAGCATCGAT TGGCCGACTG GTTCGCCGGA
TACGGCTTCC CGGTGTCGCT GCCGACAACG GTTCAGACGC GCCGCCTGCT TGAGAAGATG
AAAGGCGACA AAAAAGCGTA CGCCGGAACA GTGCGGATGG TGCTTCTCTG TGAGATCGGC
GACGTGGAAG TGGTGGAACT CGAAGACGAC AACCTGCTCA CGTGGCTGGA CGAGTTTTCC
AGACAGGGGG GAAAAGGATG A
 
Protein sequence
MIERTIETAT KRYPLLLGDG AARVLPSLLR SLSCPPGTKL FIVTDDTVAP LYLDEVRALL 
AAAEYDVYAY VIPSGEAAKS FDHYYACQTA ALQCGLDRRS VIIALGGGVV GDLAGFVAAT
YMRGIRYIQM PTTLLAHDSA VGGKVAINHP LGKNMIGAFH QPEAVVYDTS FLRTLPEREL
RSGFAEVIKH ALIRDRRFYD WLRAEIKTLA DLRGEKLAYC IEKGIDIKAS VVREDEKETG
VRAHLNFGHT LGHALESELG YGALTHGEAV AVGMLFAVFV SERFYGRSFA EHRLADWFAG
YGFPVSLPTT VQTRRLLEKM KGDKKAYAGT VRMVLLCEIG DVEVVELEDD NLLTWLDEFS
RQGGKG