Gene GM21_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2043 
Symbol 
ID8137379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2367157 
End bp2368458 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content61% 
IMG OID644869658 
Productphenylacetate-CoA ligase 
Protein accessionYP_003021853 
Protein GI253700664 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00000000000000158285 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAATTT GGGATCCGGA TTACGAATGC ATGCCGCGCG AGGAGATGGA GCAGCTCCAG 
CTGGAGCGCC TCCAGGCCAC CCTCAACCGC GTGTACAAAA ACGTCACCTG CTACCGGAAC
AAGTTCAAGG AACTGGGAAT CGTCCCCGAG GATGTCACAT CCCTCGCCGA CCTCTCGAAG
CTTCCCTTCA CCACCAAGGA AGACCTGCGC CTCAACTACC CCTACGGCAT GTTCGCGGTG
CCGCTTCGGG AGGTGGTGCG CATCCATTCC TCCAGCGGCA CCACCGGCAA ACCCACCGTC
GTCGGCTACA CCAAGCAGGA CGTGAAGGTC TGGTCCAACC TGGTGGCGCG CTTCATGACG
GCAGCCGGGG TCAACCACGA CGACGTGGTG CAGATAGCAT TCGGCTACGG CCTCTTCACT
GGCGCTTTCG GCCTTCACTA CGGCTCGGAG ATGATCGGCG CCTCGGTCAT CCCGATGGGC
GCGGGGAACA CCGAGAAGCA GATCATGATC ATGCAGGACT ACCGCACCAC CGCCCTCGTC
TCGACACCTA GCTACGCGGT GACCATAGCC GAGCGCATGG AAAAGATGGG GATCGACCCG
AAAAGCCTCT GCCTCAAGGT GGGGCTCTTC GGCGGCGAGC CCTGGTCCGA GGCGATGCGC
CGCGAGATCG AGAGCAGGCT CGGCATCTCG GCCACCGACA ACTACGGGCT TTCCGAGGTG
ATCGGACCGG GTGTCGCCGG CGAATGCCAG TGCAAGTGCG GCATGCACAT CTCCGAGGAC
GCTTTCCTCG CCGAGATCAT CGACCCCGAT ACCGGCAAGA CGCTCCCGCC GGGAAGCGTA
GGCGAACTGG TGCTCACCTC GCTCACCAAG GAAGCGTTCC CGATGGTGCG CTACCGCACC
CGCGACATCA CCTCGCTCGA CTACACAAAG TGCGACTGCG GCAGGACCAC GGTGCGCATG
AAGAAAACCA TGGGACGCTC CGACGACATG CTGATCATCA AGGGGGTGAA CGTCTACCCG
TCCCAGATAG AAGACGTCCT CTTCGCCGTC GAAGGATGCC AGCCGCACTA CCAGTTGGTG
GTCGACCGCA AAGGCGCGCT GGATACCCTG GAGATCAGGA TAGAGGTGAC CGAGAACATC
TTCTTCGACG AGATGAAGCT GCAGAAGGCT TTCCTAGACA ACGTCGAGCG GCGCATTGAC
TCAGTGCTCG GTGTCGGCGC CGTGGTGAAA CTGGTCGAGC CTAACAGCAT CCCGAGGGCC
GAAGGTAAAG CCTCCAGAGT CATTGACAAC AGAAAAATCT AG
 
Protein sequence
MSIWDPDYEC MPREEMEQLQ LERLQATLNR VYKNVTCYRN KFKELGIVPE DVTSLADLSK 
LPFTTKEDLR LNYPYGMFAV PLREVVRIHS SSGTTGKPTV VGYTKQDVKV WSNLVARFMT
AAGVNHDDVV QIAFGYGLFT GAFGLHYGSE MIGASVIPMG AGNTEKQIMI MQDYRTTALV
STPSYAVTIA ERMEKMGIDP KSLCLKVGLF GGEPWSEAMR REIESRLGIS ATDNYGLSEV
IGPGVAGECQ CKCGMHISED AFLAEIIDPD TGKTLPPGSV GELVLTSLTK EAFPMVRYRT
RDITSLDYTK CDCGRTTVRM KKTMGRSDDM LIIKGVNVYP SQIEDVLFAV EGCQPHYQLV
VDRKGALDTL EIRIEVTENI FFDEMKLQKA FLDNVERRID SVLGVGAVVK LVEPNSIPRA
EGKASRVIDN RKI