Gene GM21_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2118 
Symbol 
ID8137454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2475222 
End bp2476862 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content65% 
IMG OID644869733 
Productpeptide synthase 
Protein accessionYP_003021928 
Protein GI253700739 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGAGA GCGAACTGGT CAATATTGCC GCGCACCTGC CGGAGATGGC GAAGCGCCAG 
CCGGATACCA GGGCCATCAT CTTCCCCAAG CAAAACCGGA GCTTGAGCTT CTCCGAGTTC
AACACGCTTA GCGACCGGAT CGCGCGGGGG CTGATCGCCA ACGGTATTTG CCGCGGCGTG
CGCACCGTGC TTATGGTGAC GCCGAGCCCC GAGTTCTTCG CTCTTACCTT CGCGCTCTTC
AAGGTGGGCG CGGTGCCGGT ACTGATCGAC CCGGGGCTGG GGATCAAGAA CCTGAAGCAG
TGCTTTGCCG AGGCGCAGCC GCACGCCTTC ATCGGCATCC CCAAAGCGCA CCTGGCGCGG
CTTATCTTCG GCTGGGGCAA GGAGACCATC CGGACCTTCA TCACTGTAGG CCCGCGACTT
TTCTGGGGTG GAACCACACT CGCCAGAATC ATCGAGGAGC ACACCGACGC ATCCTCATTC
GTCCCCGCCC CGACCGGCTC GGAGGACGTC GCCGCCATCC TCTTCACCAG CGGCAGCACC
GGGGTCCCCA AGGGGGCCGT CTACAGTCAC GGCAATTTCG CGGCGCAGGT GCAGGCACTG
AAACAGGTCT ACGGCATCGA ACCGGGCGAG ATCGACCTCC CCACCTTCCC GCTCTTCGCC
CTCTTCGCCC CCGCCCTCGG CATGACCGCC GTCATCCCGG AGATGGACTT CACCAGGCCC
GGATCGGTGA ACCCGAAGAA GATCGTCGGC GCCATCCACA CCTACGGCGT CACCACCATG
TTCGGCTCGC CCGCCCTCAT CAACCGGGTC GGGCGCTACG GGGTGCAGCA CCAGGTGAAA
CTCCCGACCT TGCGGCGCGC CATCTCCGCC GGCGCCCCGG TGTCCGCGAC GGTATTGGAG
CGCTTCACTT CGCTTCTCAA CCCCGGCGTG CAGGTCTTCA CCCCTTACGG CGCCACCGAG
GCGCTCCCCG TCTGCTCCAT AGGCAGCACC GAGATCCTGG AGGCGACCCG TCAGATAACG
GACGCCGGCG GCGGGGTCTG CGTCGGCAGG CCGGTCGAGG GGATCCGGCT GGAGATCATC
CAGATCAGCG ACGACCCCAT CTGCTGCTGG CACGATTCGC TGCGGGTGCC CACCGGGAAG
ATCGGCGAGA TCGTGGTGCA GGGGGAGCAG GTTACAAGGG GGTACTACAA CCGCCCTGAA
TCGGACCACC TCTCTAAGAT CCCCGATCCC GAGACCGGCT CCTTCTTCCA CCGCATGGGG
GACTTAGGCG GGCGCGACGA GGAAGGAAGG GTCTGGTTCT GCGGACGCAA GTCGCATCGC
GTGGAGACCG AGTCCGGGCC GCTTTACACC ATCCCCTGCG AGGCGGTGTT CAACGCCCAT
CCGGCCGTGT TCCGTACCGC GCTGGTAGGG GTGGGGACGC CGGGCGAGCT GATGCCCGTG
CTCTGCGTGG AGCTGGAAAA GGAGATCAAG ACGGACCCGG AACTGGTGCG GGCCGAGCTC
CTCTCGTTCG CACAGGACCA CATCCACACC AAGGGGATCG AAACCATACT GTTCCACCCG
GCCTTTCCGG TGGATATCCG TCACAACGCC AAGATCTTCC GGGAGAAGCT GGCGGTATGG
GCGGCCGCGA GGCTCAAATG A
 
Protein sequence
MLESELVNIA AHLPEMAKRQ PDTRAIIFPK QNRSLSFSEF NTLSDRIARG LIANGICRGV 
RTVLMVTPSP EFFALTFALF KVGAVPVLID PGLGIKNLKQ CFAEAQPHAF IGIPKAHLAR
LIFGWGKETI RTFITVGPRL FWGGTTLARI IEEHTDASSF VPAPTGSEDV AAILFTSGST
GVPKGAVYSH GNFAAQVQAL KQVYGIEPGE IDLPTFPLFA LFAPALGMTA VIPEMDFTRP
GSVNPKKIVG AIHTYGVTTM FGSPALINRV GRYGVQHQVK LPTLRRAISA GAPVSATVLE
RFTSLLNPGV QVFTPYGATE ALPVCSIGST EILEATRQIT DAGGGVCVGR PVEGIRLEII
QISDDPICCW HDSLRVPTGK IGEIVVQGEQ VTRGYYNRPE SDHLSKIPDP ETGSFFHRMG
DLGGRDEEGR VWFCGRKSHR VETESGPLYT IPCEAVFNAH PAVFRTALVG VGTPGELMPV
LCVELEKEIK TDPELVRAEL LSFAQDHIHT KGIETILFHP AFPVDIRHNA KIFREKLAVW
AAARLK