Gene Msed_0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0388 
Symbol 
ID5103631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp335604 
End bp336887 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content54% 
IMG OID640506294 
Productphenylacetate-CoA ligase 
Protein accessionYP_001190489 
Protein GI146303173 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.128203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGATG AGACTGACCC AAGGGCCTTA ACCAAGGAAG AGATCGAGAA CGTGCAGGCT 
TTCAGGCTCA GGCGGGCGTT GAGGAGGGCC TACGAGAGAA GTCCCTTCTA CAGGAGGATC
TTTAAGGAGA GAAATCTGAC TCCCGACGAC ATTCGCACGA AAGAGGACCT AAGGAAGCTA
CCTTTCACCA CTAAGACCGA CCTGAGGGAG AAGGGCTACC CATACGGAGG CGAGTTCATG
ACGGTGGAAC TGGAGGAAAT AGTGGGCTGG CATATGACCA GCGGGACCAC TGGGGTTCCC
ACAGTTGGGG CTTACACTTC CTCGGATGTG GAGTTGTGGG CTAACCTCGT TGCGAGAAGT
CTCAGGACAG CTGGTGTCAC GCGGAAGGAC ATAATTGCAA ACGTTTACGG CTACGGTCTA
TTCACAGGCG GAATGGGACT TCACTTGGGG GCGCAGAAGA TAGGGGCAAA GGTAATCCCT
TGGAGCACTG GGAGGACAGA GGCACTGGCC AAGACATTGA AGGACTTCAG GGCCACGGTA
ATCACTGGAA CACCCTCATA CGAGCTGGTA ATAGCCGAGA AGGTAAGGGA GGCAGGGCTT
GACCCGGAGA GGGACCTGAC GTTAAGGCTC GCAATACCTG GTGCAGAGTC AATGACCCCG
GAGATGTTGA GGAGGATAGA GAAGGAACTG GGTCTCCTGA GCAGGGGAGG AGGTGCCAGG
GAGATATACG GGCTCACCGA GGCTATAGGT CCTGGGGTAG CTCAGGAATG CCCCCACGAC
AATCACGAGT TCATGCATAT CTGGACTGAT CACTTCCTAG TGGAGATAAT AGATCCAGAC
ACGGGAGAGA ACGTGGGTGA GGGCGAGGAG GGAGAGATGG TTTTCACTCA CCTCACCAGG
GAGGGAATGC CACTAATTAG GTATAGAACG AGGGACATTA CCAGGTTGGT GGAGAGCGAC
GACGACATCC CCTTCCCAAA GGTCGCAATA ATGAAGGGAA GGTCAGACGA CGTCATTTTC
TACAAGGGGG TGAAGCTTTA TCCCACGGCC ATTAATGAGG TACTCATGAA GATGCCCGAG
GTCATGGAGT ATCAGATGGT GATAACTAAG GACCCGCAGA AGTTTCTTCT CCTGGTGGAG
ACTACTTCTC CCTCCGAGGA TCTTAGAAGG CGGATAGTTA CGGACATCAA GAACGTTACC
TTCGTCAATC CCGAGGTTGA CTTCGTGTCC CCTGGAACCC TTCCCAGGTT CGAGGGCAAG
TCCAAGAGGG TGGTTCTGAA GTGA
 
Protein sequence
MYDETDPRAL TKEEIENVQA FRLRRALRRA YERSPFYRRI FKERNLTPDD IRTKEDLRKL 
PFTTKTDLRE KGYPYGGEFM TVELEEIVGW HMTSGTTGVP TVGAYTSSDV ELWANLVARS
LRTAGVTRKD IIANVYGYGL FTGGMGLHLG AQKIGAKVIP WSTGRTEALA KTLKDFRATV
ITGTPSYELV IAEKVREAGL DPERDLTLRL AIPGAESMTP EMLRRIEKEL GLLSRGGGAR
EIYGLTEAIG PGVAQECPHD NHEFMHIWTD HFLVEIIDPD TGENVGEGEE GEMVFTHLTR
EGMPLIRYRT RDITRLVESD DDIPFPKVAI MKGRSDDVIF YKGVKLYPTA INEVLMKMPE
VMEYQMVITK DPQKFLLLVE TTSPSEDLRR RIVTDIKNVT FVNPEVDFVS PGTLPRFEGK
SKRVVLK