Gene GM21_3232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3232 
Symbol 
ID8138584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3750687 
End bp3751778 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content65% 
IMG OID644870836 
Productchalcone and stilbene synthase domain protein 
Protein accessionYP_003023016 
Protein GI253701827 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3424] Predicted naringenin-chalcone synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGCA ATGTCTTCGT GGGCTCCATC GCCACCGTCG TTCCGCCGTT ATCCGTGGAT 
CAGCAGGAGG CGGCGGCGCT GATCAAGTCG CATTTCAAGG AGAGCCTCAC CGCGCGCGGC
CTCGGGCTGA TTCGCGCCAC CTTCAACCAT CCCAGCATCA AGAAAAGGCA TTTCGCCGTC
GATACCCCGG CGCGGATCTT CACCGAGACC CCCGATGAGC GGGTAGAGCG TTTCACCGAG
CAGGCGGTCC GGCTGGCGGA GCAGGCTGTG CTGCGGGCGC TTGATAAGGC TGGGGTGGGG
GTAAGGGAGG TGAACGGGCT GGTGCTGAAC ACCTGCACCG GCTACATCTG CCCCGGCCTT
TCCAGCTATG TCGCCGAGCG CCTGGGGCTT CGCTGCGACG CGAGGTTGTA CGACCTGGTG
GGGAGCGGCT GCGGCGGAGC GGTCCCCAAC CTGCAGGTGG CCGAGTCCAT GTTGAAGACG
ACCGGCGGCA TCGTGGTGAG CGTGTCGGTT GAGATCTGCA GCGCCGCCTT CCAGATGGGT
AACGACTTAA GCCTCATACT CTCCAACGCG CTCTTCGGCG ACGGCGCTGC GGCGGCCGTG
CTCTGGGAGA AGCCGGCCGG TTTCGAGTTG GTCGCCTCCG CCGGACGCTA CGTGCCGGAG
CAGCGCGAAG CGATCCGCTT CGTGCACCGG CAGGGACAGC TCCACAACCA GCTATCCACC
GACCTCCCGC AACTGGTAAG AAAGGCCGCG GCTCAGGTGG TCGCGGACCT TCTGGAAAGA
CATTCCCTCT CCATCGGCGA CATCGGCGGC TGGGCGCTCC ATACCGGCGG TGAAAAGATA
GTCAACGCGG TGCGGGACGA GATCGGGATC GACGAGTCGC AACTGTGGGC GACCCGGAAG
GTGCTGGAGC AGTACGGCAA CATGTCCTCG CCCACGGTCT GGTTCGTCTT GGATGAACTG
CTGCAGAACG GGATGCGCGA GGATGAGTGG TGCGTGATGC TCGCCTACGG CGCCGGGCTT
TCGGCGCACG CCTATTTGCT GAGAGGCTGG GGGCTGGGGG CTGGGGGCTG GGGGCTGGGG
GCTGGGCGCT AG
 
Protein sequence
MNSNVFVGSI ATVVPPLSVD QQEAAALIKS HFKESLTARG LGLIRATFNH PSIKKRHFAV 
DTPARIFTET PDERVERFTE QAVRLAEQAV LRALDKAGVG VREVNGLVLN TCTGYICPGL
SSYVAERLGL RCDARLYDLV GSGCGGAVPN LQVAESMLKT TGGIVVSVSV EICSAAFQMG
NDLSLILSNA LFGDGAAAAV LWEKPAGFEL VASAGRYVPE QREAIRFVHR QGQLHNQLST
DLPQLVRKAA AQVVADLLER HSLSIGDIGG WALHTGGEKI VNAVRDEIGI DESQLWATRK
VLEQYGNMSS PTVWFVLDEL LQNGMREDEW CVMLAYGAGL SAHAYLLRGW GLGAGGWGLG
AGR