Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0388 |
Symbol | |
ID | 5103631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 335604 |
End bp | 336887 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640506294 |
Product | phenylacetate-CoA ligase |
Protein accession | YP_001190489 |
Protein GI | 146303173 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1541] Coenzyme F390 synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.128203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGATG AGACTGACCC AAGGGCCTTA ACCAAGGAAG AGATCGAGAA CGTGCAGGCT TTCAGGCTCA GGCGGGCGTT GAGGAGGGCC TACGAGAGAA GTCCCTTCTA CAGGAGGATC TTTAAGGAGA GAAATCTGAC TCCCGACGAC ATTCGCACGA AAGAGGACCT AAGGAAGCTA CCTTTCACCA CTAAGACCGA CCTGAGGGAG AAGGGCTACC CATACGGAGG CGAGTTCATG ACGGTGGAAC TGGAGGAAAT AGTGGGCTGG CATATGACCA GCGGGACCAC TGGGGTTCCC ACAGTTGGGG CTTACACTTC CTCGGATGTG GAGTTGTGGG CTAACCTCGT TGCGAGAAGT CTCAGGACAG CTGGTGTCAC GCGGAAGGAC ATAATTGCAA ACGTTTACGG CTACGGTCTA TTCACAGGCG GAATGGGACT TCACTTGGGG GCGCAGAAGA TAGGGGCAAA GGTAATCCCT TGGAGCACTG GGAGGACAGA GGCACTGGCC AAGACATTGA AGGACTTCAG GGCCACGGTA ATCACTGGAA CACCCTCATA CGAGCTGGTA ATAGCCGAGA AGGTAAGGGA GGCAGGGCTT GACCCGGAGA GGGACCTGAC GTTAAGGCTC GCAATACCTG GTGCAGAGTC AATGACCCCG GAGATGTTGA GGAGGATAGA GAAGGAACTG GGTCTCCTGA GCAGGGGAGG AGGTGCCAGG GAGATATACG GGCTCACCGA GGCTATAGGT CCTGGGGTAG CTCAGGAATG CCCCCACGAC AATCACGAGT TCATGCATAT CTGGACTGAT CACTTCCTAG TGGAGATAAT AGATCCAGAC ACGGGAGAGA ACGTGGGTGA GGGCGAGGAG GGAGAGATGG TTTTCACTCA CCTCACCAGG GAGGGAATGC CACTAATTAG GTATAGAACG AGGGACATTA CCAGGTTGGT GGAGAGCGAC GACGACATCC CCTTCCCAAA GGTCGCAATA ATGAAGGGAA GGTCAGACGA CGTCATTTTC TACAAGGGGG TGAAGCTTTA TCCCACGGCC ATTAATGAGG TACTCATGAA GATGCCCGAG GTCATGGAGT ATCAGATGGT GATAACTAAG GACCCGCAGA AGTTTCTTCT CCTGGTGGAG ACTACTTCTC CCTCCGAGGA TCTTAGAAGG CGGATAGTTA CGGACATCAA GAACGTTACC TTCGTCAATC CCGAGGTTGA CTTCGTGTCC CCTGGAACCC TTCCCAGGTT CGAGGGCAAG TCCAAGAGGG TGGTTCTGAA GTGA
|
Protein sequence | MYDETDPRAL TKEEIENVQA FRLRRALRRA YERSPFYRRI FKERNLTPDD IRTKEDLRKL PFTTKTDLRE KGYPYGGEFM TVELEEIVGW HMTSGTTGVP TVGAYTSSDV ELWANLVARS LRTAGVTRKD IIANVYGYGL FTGGMGLHLG AQKIGAKVIP WSTGRTEALA KTLKDFRATV ITGTPSYELV IAEKVREAGL DPERDLTLRL AIPGAESMTP EMLRRIEKEL GLLSRGGGAR EIYGLTEAIG PGVAQECPHD NHEFMHIWTD HFLVEIIDPD TGENVGEGEE GEMVFTHLTR EGMPLIRYRT RDITRLVESD DDIPFPKVAI MKGRSDDVIF YKGVKLYPTA INEVLMKMPE VMEYQMVITK DPQKFLLLVE TTSPSEDLRR RIVTDIKNVT FVNPEVDFVS PGTLPRFEGK SKRVVLK
|
| |