Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2043 |
Symbol | |
ID | 8137379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2367157 |
End bp | 2368458 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869658 |
Product | phenylacetate-CoA ligase |
Protein accession | YP_003021853 |
Protein GI | 253700664 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1541] Coenzyme F390 synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00000000000000158285 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAATTT GGGATCCGGA TTACGAATGC ATGCCGCGCG AGGAGATGGA GCAGCTCCAG CTGGAGCGCC TCCAGGCCAC CCTCAACCGC GTGTACAAAA ACGTCACCTG CTACCGGAAC AAGTTCAAGG AACTGGGAAT CGTCCCCGAG GATGTCACAT CCCTCGCCGA CCTCTCGAAG CTTCCCTTCA CCACCAAGGA AGACCTGCGC CTCAACTACC CCTACGGCAT GTTCGCGGTG CCGCTTCGGG AGGTGGTGCG CATCCATTCC TCCAGCGGCA CCACCGGCAA ACCCACCGTC GTCGGCTACA CCAAGCAGGA CGTGAAGGTC TGGTCCAACC TGGTGGCGCG CTTCATGACG GCAGCCGGGG TCAACCACGA CGACGTGGTG CAGATAGCAT TCGGCTACGG CCTCTTCACT GGCGCTTTCG GCCTTCACTA CGGCTCGGAG ATGATCGGCG CCTCGGTCAT CCCGATGGGC GCGGGGAACA CCGAGAAGCA GATCATGATC ATGCAGGACT ACCGCACCAC CGCCCTCGTC TCGACACCTA GCTACGCGGT GACCATAGCC GAGCGCATGG AAAAGATGGG GATCGACCCG AAAAGCCTCT GCCTCAAGGT GGGGCTCTTC GGCGGCGAGC CCTGGTCCGA GGCGATGCGC CGCGAGATCG AGAGCAGGCT CGGCATCTCG GCCACCGACA ACTACGGGCT TTCCGAGGTG ATCGGACCGG GTGTCGCCGG CGAATGCCAG TGCAAGTGCG GCATGCACAT CTCCGAGGAC GCTTTCCTCG CCGAGATCAT CGACCCCGAT ACCGGCAAGA CGCTCCCGCC GGGAAGCGTA GGCGAACTGG TGCTCACCTC GCTCACCAAG GAAGCGTTCC CGATGGTGCG CTACCGCACC CGCGACATCA CCTCGCTCGA CTACACAAAG TGCGACTGCG GCAGGACCAC GGTGCGCATG AAGAAAACCA TGGGACGCTC CGACGACATG CTGATCATCA AGGGGGTGAA CGTCTACCCG TCCCAGATAG AAGACGTCCT CTTCGCCGTC GAAGGATGCC AGCCGCACTA CCAGTTGGTG GTCGACCGCA AAGGCGCGCT GGATACCCTG GAGATCAGGA TAGAGGTGAC CGAGAACATC TTCTTCGACG AGATGAAGCT GCAGAAGGCT TTCCTAGACA ACGTCGAGCG GCGCATTGAC TCAGTGCTCG GTGTCGGCGC CGTGGTGAAA CTGGTCGAGC CTAACAGCAT CCCGAGGGCC GAAGGTAAAG CCTCCAGAGT CATTGACAAC AGAAAAATCT AG
|
Protein sequence | MSIWDPDYEC MPREEMEQLQ LERLQATLNR VYKNVTCYRN KFKELGIVPE DVTSLADLSK LPFTTKEDLR LNYPYGMFAV PLREVVRIHS SSGTTGKPTV VGYTKQDVKV WSNLVARFMT AAGVNHDDVV QIAFGYGLFT GAFGLHYGSE MIGASVIPMG AGNTEKQIMI MQDYRTTALV STPSYAVTIA ERMEKMGIDP KSLCLKVGLF GGEPWSEAMR REIESRLGIS ATDNYGLSEV IGPGVAGECQ CKCGMHISED AFLAEIIDPD TGKTLPPGSV GELVLTSLTK EAFPMVRYRT RDITSLDYTK CDCGRTTVRM KKTMGRSDDM LIIKGVNVYP SQIEDVLFAV EGCQPHYQLV VDRKGALDTL EIRIEVTENI FFDEMKLQKA FLDNVERRID SVLGVGAVVK LVEPNSIPRA EGKASRVIDN RKI
|
| |