Gene GM21_2435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2435 
Symbol 
ID8137776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2841578 
End bp2842735 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content63% 
IMG OID644870045 
Productcycloartenol synthase-like protein 
Protein accessionYP_003022236 
Protein GI253701047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones160 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TCTCCTTGCT TTTAACGACC GTCGTTGTCG CCCTCGGGAC GCTGATTACC 
CCCGCTTTCG CCGCCGAAAA ACAGAAGGGT GTAGCCGTAT CGAAGCAGGA CCTCTCGCTG
AAGCTGGAGG TTGAGAACGC CATAGGAAAG GGGCTCTCCT GGCTCGCCTT GCGCCAGAAT
TCGGCGGGGC ACTGGTCCCA GCCGGAATAC CCGGCGCTCT CGGCCCTGGT GCTCACCTCC
TTCCAGGGGG ACCCGTCGGG GTACTACAAA CGGAAGTACG AGCCGCAGAT CGGCAAGGGG
TACCGCTACC TCCTCTCCAA CGTCAGGCCC GACGGCGGCA TCTACGGCAA GGACCTCGCC
AACTACAACA CCTCCATCTC GATGATGGCG CTCCTTATGT CCAACAACCC CGAGTACGAG
CCGGTTCTCA AAAGGGCTCG CGGCTTCCTG GTCGGGCTCC AGGACAAGAG GGGGGACCAG
TTCGACGGCG GCATCGGCTA CGGCGGCAGC TACAAGAACT CGGACATAGT CAACACCTCC
TTCGCCCTGG AGGCGCTGCA CTACACCCGC TACCTGAAGA GCGACGTGGC GGGGGAGGCC
GAGGACCTGG ACTGGAAGAG GGCGGTCAGG TTCATCTCCC GCACCCAGAA CCTCCCCGGC
TATAACGACC AGAAGTGGGT CACCGGCGAC GCCGAGAACC GCGGCGGCTT CGTCTATTTC
CCCGGCGATT CGAAGGCGGG CGACAAGGTT CTCCCCGACG GCCGGGTGGC GCTCCGCTCC
TACGGCAGCG GCTCCTATGC CGGCCTCTTG AGCTACATCT ACGCGCAGAT GGACAAGAAC
GACCCCCGCG TGAAGGAGGT CTACAACTGG CTCACCGCGA ACTACACGCT TGAGGAAAAC
CCGGGGATGG GGCAGGAAGG GCTTTACTAC TACTACCACA CCATGGCCAA GGCCCTGAGC
ACCTACGGCG TGGACAGCAT CAAGCTCAAA AACGGCAAGA GCGTCAGCTG GCGCACCGAT
CTCGCCAAGC GCTTCCTCGA CCTGCAGAAG GAGGACGGCT CCTGGGTGAA CACCACCGGC
CGCTGGTGGG AGAGGGACCC CGTCCTGGTA ACCTCCTACG CCGTGCTCAC CCTGGAGATC
CTGCACCGGG GGCTGTGA
 
Protein sequence
MKKLSLLLTT VVVALGTLIT PAFAAEKQKG VAVSKQDLSL KLEVENAIGK GLSWLALRQN 
SAGHWSQPEY PALSALVLTS FQGDPSGYYK RKYEPQIGKG YRYLLSNVRP DGGIYGKDLA
NYNTSISMMA LLMSNNPEYE PVLKRARGFL VGLQDKRGDQ FDGGIGYGGS YKNSDIVNTS
FALEALHYTR YLKSDVAGEA EDLDWKRAVR FISRTQNLPG YNDQKWVTGD AENRGGFVYF
PGDSKAGDKV LPDGRVALRS YGSGSYAGLL SYIYAQMDKN DPRVKEVYNW LTANYTLEEN
PGMGQEGLYY YYHTMAKALS TYGVDSIKLK NGKSVSWRTD LAKRFLDLQK EDGSWVNTTG
RWWERDPVLV TSYAVLTLEI LHRGL