Gene GM21_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2199 
Symbol 
ID8137535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2569681 
End bp2571327 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content63% 
IMG OID644869814 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_003022009 
Protein GI253700820 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.0000113642 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGATT TGCTGATCCC CCGCACCGCG TCCGCCTACG ACTACCCTTT GCTGATAAAG 
AACCTGTTGC TGTACCCGGT GGTCGACAAC CCCGACCAGG AGATCGTCTA CCGCGACCTG
TACCGGGGCA ACTACCGCCA ACTGAGGGAG CGGGTGAAGC GGTTGGCCAA CATGCTCACC
GGGCTTGGCG TGAAGCCGGG GCAAACGGTG GCCGTTATGG ACTGGGACAG CCATCGCTAC
CTGGAGCTCT TCTTCGCGGT GCCGATGATC GGCGCGGTGC TCCACACCAT CAACGTGCGC
CTCTCCGCGG AGCAGATCCT CTACACCATC GACCATGCCG AGGACGACGT GCTGCTGGTC
AACAGCGAGT TTCTCCCCAT CATGGAGCAG ATCCGCGGCA GGATCGACAA CGTCCGCACC
TATATCCTCA TCTCCGACGA CGGCATGACG GAATGCAGCA CCATCCCCGC CTGCGGCGAG
TACGAACAGC TCCTGGCCCA GGCCTCGCCG GAGTTCGAAT TCCCCGATCT GGACGAGAAC
ACCAGGGCCA CGACCTTCTA CACCACCGGG ACCACGGGGA TGCCGAAGGG GGTCTATTTC
AGCCACCGGC AACTGGTGCT CCATTCCCTG GGGCTTTTGG CGACGCTCGG TTCCTCCACC
TCGCACGCCT GCCTGCACCG CGATGACGTC TACATGCCGA TAACGCCAAT GTTCCACGTC
CATGCCTGGG GGGTCCCCTA TATCGCCACG ATGCTGGGGG TGAAGCAGGT CTATCCCGGT
CGCTACCTCC CGGAGACCCT GCTGGAGCTC AAAGAGAAGG AAGGAGTCAC CTTCTCCCAT
TGCGTTCCGA CCATCTTGCA TATGCTCTTG AAGCACCCCC ACGCGGAAAA GATCGACCTG
CGGGGCTGGA AGCTCATCAT CGGCGGCGCG GCCTTGTCGC GCAACCTCTG CGTCGAGTCC
CTGAAGCTTG GGATCGACGT CTTCACCGGG TACGGGATGT CCGAGACCTG CCCGATCCTC
ACCATTTCCA AGCTCACCCC GGAGATGCTG GAGCTCTCCC ACGCGGAGCA GGCGGAGATC
CGCTGCAAGA CTGGCCTTGC TCTGGCGTTC GTCGATCTGC GCGTGGTCGA CAGCGACTTC
AACGAGCTCC CCCGCGACGG CGTCAGCGCC GGCAACGTGG TGGTCCGCTC CCCCTGGCTC
ACCCAGGGAT ACCTGAAGGA CCACAAGGCC TCCGAGCGTC TCTGGGAGGG AGGGTATCTC
CATACCGGCG ACGTGGCGGT GCGGGACGAA CTGGGCTATC TGAAGATCAC CGACCGGAGC
AAGGACGTGA TCAAGGTCGC CGGCGAATGG GTTTCCTCGC TGGAGCTTGA GGACATCGTC
GCGCACCACC CCGCGGTAGC CGAGGTGGCG GTGATAGGGA AGCCCGACGA GAAGTGGGGC
GAGCGCCCCC TGGCGCTGGT CGTTCTCAAG CCGACGGAGG GGACGAAGGT AACCGATAAG
GAGATCGCCC ACCACGTGAG GGAGTACGCA GACAAGGGTG TGGTGAGCAA GCAGGTCGTT
CTGGTCAAGG TGAAGCTCGT TCCCTCCATC GACAAGACCA GCGTGGGGAA GATCAACAAG
GTGGCGCTGC GGGAGAAATA TCTCTAA
 
Protein sequence
MSDLLIPRTA SAYDYPLLIK NLLLYPVVDN PDQEIVYRDL YRGNYRQLRE RVKRLANMLT 
GLGVKPGQTV AVMDWDSHRY LELFFAVPMI GAVLHTINVR LSAEQILYTI DHAEDDVLLV
NSEFLPIMEQ IRGRIDNVRT YILISDDGMT ECSTIPACGE YEQLLAQASP EFEFPDLDEN
TRATTFYTTG TTGMPKGVYF SHRQLVLHSL GLLATLGSST SHACLHRDDV YMPITPMFHV
HAWGVPYIAT MLGVKQVYPG RYLPETLLEL KEKEGVTFSH CVPTILHMLL KHPHAEKIDL
RGWKLIIGGA ALSRNLCVES LKLGIDVFTG YGMSETCPIL TISKLTPEML ELSHAEQAEI
RCKTGLALAF VDLRVVDSDF NELPRDGVSA GNVVVRSPWL TQGYLKDHKA SERLWEGGYL
HTGDVAVRDE LGYLKITDRS KDVIKVAGEW VSSLELEDIV AHHPAVAEVA VIGKPDEKWG
ERPLALVVLK PTEGTKVTDK EIAHHVREYA DKGVVSKQVV LVKVKLVPSI DKTSVGKINK
VALREKYL