Gene GM21_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2035 
Symbol 
ID8137371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2359123 
End bp2360427 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content62% 
IMG OID644869650 
ProductPhenylacetate--CoA ligase 
Protein accessionYP_003021845 
Protein GI253700656 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID[TIGR02155] phenylacetate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00000000000365841 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCTTCA ACGAGGAGTT CGAGACGCTT CCCAGGGAGG CTATCGAGGC ACTGCAGCTT 
AAAAGGCTCA AGGCGATGGT GGCGCGCGTT CAACAAAACG TCCCCTTCTA CAAGGAGTCG
CTGGCCAAGG CGGGAGTCGG CGCCGATTCC ATCAAGTCGC TTTCGGACCT GGCCCGGCTC
CCCTTCACCT ACAAGCAGGA CATGCGCGAC TCCTACCCGT ACCGCCTCTT CGCAGTGCCG
ATGGAGGACA TCGTCCGCAT CCACGCCTCT TCCGGCACCA CCGGCAAACC CACGGTGGTC
GGCTACACCC AGAAGGACAT CGACACCTGG AGCGAGCTGA TGGCGCGCTC GTTCGTCGCA
GCCGGGGTGC ACAAGGGCGA CATCATCCAC AACTCCTACG GCTACGGCCT CTTCACCGGC
GGCCTGGGCG CGCACTACGG CGCTGAGCGG CTGGGGGCGT CCGTCATTCC GATGTCAGGG
GGTAACACCA AGAAACAGAT CATGATCATG CAGGACTTCG GTTCCACCGT CCTCACCTGC
ACCCCTTCCT ATTCGCTCTA CATGGCGGAG GCCGCCAAGG AGGAGGGGGT CGACTTCCGC
GATCTGAAGC TCAAAGTCGG CATCTTCGGC GCCGAGCCCT GGTCCGAGGC GATGCGCCTC
GACATCGAGG AGAAGCTGAA TCTCTCCGCC GTCGACATCT ACGGGCTCTC GGAAATCATG
GGACCCGGCG TCGCCATCGA GTGCTGCGAG GCGAAACAGG GGCTCCACGT CTGGGAGGAT
CACTTCATCC CCGAGATCAT CAACCCCGAG ACCGGCGAAG TGCTTCCCGA AGGCGCTAAG
GGGGAGCTGG TCATCACCAC CATCACCAAG GAAGGGATCC CGCTGATCCG CTACCGGACC
CGCGACATCA CCTCCATCAC CTACGAGCCC TGCATCTGCG GCAGGACCCA TGCCCGCATC
GCCCGCATGA GCGGCAGAAG CGACGACATG CTGATCATCC GCGGAGTCAA CGTCTTCCCG
TCGCAGATCG AGGCGATCCT CATGGGGGTC GAAGGGGTCG AGCCGCACTA CGTCCTCATC
GTGGATAGAA AGGACAACCT GGACACCCTC GAGGTGCAGG TCGAGGTGGG CGAGGACATC
TTCTCCGACG AGATCAAGCA CCTCCAGGCG CTCTCGACCA AGATCGAGAA GCAGATCAAG
GAGATGCTGG GGGTCACCTG CCGCGTCAGG CTCGTGGAAC CCAAGAGCAT CACCCGCAGC
GAAGGCAAAG CCAAGAGGGT CATCGACAAC AGGAACAAAG CCTAA
 
Protein sequence
MFFNEEFETL PREAIEALQL KRLKAMVARV QQNVPFYKES LAKAGVGADS IKSLSDLARL 
PFTYKQDMRD SYPYRLFAVP MEDIVRIHAS SGTTGKPTVV GYTQKDIDTW SELMARSFVA
AGVHKGDIIH NSYGYGLFTG GLGAHYGAER LGASVIPMSG GNTKKQIMIM QDFGSTVLTC
TPSYSLYMAE AAKEEGVDFR DLKLKVGIFG AEPWSEAMRL DIEEKLNLSA VDIYGLSEIM
GPGVAIECCE AKQGLHVWED HFIPEIINPE TGEVLPEGAK GELVITTITK EGIPLIRYRT
RDITSITYEP CICGRTHARI ARMSGRSDDM LIIRGVNVFP SQIEAILMGV EGVEPHYVLI
VDRKDNLDTL EVQVEVGEDI FSDEIKHLQA LSTKIEKQIK EMLGVTCRVR LVEPKSITRS
EGKAKRVIDN RNKA