Gene Msed_1399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1399 
Symbol 
ID5104609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1370105 
End bp1372087 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content47% 
IMG OID640507288 
ProductCoA-binding domain-containing protein 
Protein accessionYP_001191481 
Protein GI146304165 
COG category[C] Energy production and conversion 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming) 
TIGRFAM ID[TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.578078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000299945 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGCTAG ATTACCTCTT TAGGCCTAGA ACCATAGCGG TCGTAGGAGC TTCACGCAAC 
AGGGAGAAAG TTGGAAACGT GATTTTTAGA AACGTCGCAT CTACTTTTCG CGGTAAGGTT
TATCCTGTCA ACAATAAATC AGAAACCATC GAGGGTGTCC AATCTTATAA GTCCTTGAAG
GATATACAAG ACGACGTTGA TCTTGCCATC ATTTCTGTGC CAAGGGAGTC AGTTCCAGGG
GTTATGGAAG AGGCTGGTGA GAAAGGCGTA AGGTCGGCCA TAGTTATCAC CTCCGGCTTT
AGGGAAGTAG GAGAGGAAGG AGAAAAGCTA GAGAGGGAAG TCGCATCAAT TGCTAACAAG
TATGGGATAA GGTTTCTTGG GCCAAATACC ATGGGTCTTC TAACTCCTGA TTATAACGGT
ACCTTCGCCT TCGCAGACGT TAAACGTGGA GAAATTGCCC TCGTGGTACA GAGCGGTGGA
ATAGGTGCTT ACATGCTTAA CTGGGCTCAG AAATCTAGGA CAGGTGTAAG TTTTCTCGTT
AGTCTAGGAA ACCAGACTGA CGTGAAGGAG TACGAGGTTA TAGATTATTT AGCGAAGGAC
CCTGAAACTA AGGCAATCTT TGTGTACATT GAGGGAGTAT CTGACGGAGA GAAGTTTCTC
AACGTGGTTC CAGAGGCCAC ATCCAGGAAA CCGGTCATCT TCATCAAAGG GGGTGCCTCG
GCCCAGAGCG CAGAGGCCGT AAAGACCCAC ACTGGGAGTC TAGCAGGATC TTATGAGGTT
TTTAAGGCAG CCATAAAGAC AGTGGGAGGG ATATTTGTTG AGAACCTTAA GGATTTCTTA
AACCTATCAA AACTTGTTAA CTCCTCGGAA CCTATCAGGG AAGAGATCCT GGTGGTGACC
AACTCAGGAG GTCACGGTGT ACTGACTTCA GATGCCATAA GTAGGGCTGG ACTATCCCTC
GTAAAAATTC CTGAAAGGCT AAACATGGAA CTCAGAAAGG TCCTTCCACC CCAAAGCATA
CCGAAGAATC CCCTTGACCT ATCCGGAGAT GCAGGAAGAG ACAGGTATCT AAATTCGCTC
AAGATAGTAT CAGACCTAGA TTGCACCAAA CTAGTAATTG TGGAGTCCTT GCCATTTATA
AGCTGTACGG AGGTGGCAAA GGTCCTGCTC AACTTCAAAG GAAAGGGAAT CATAGGCGTA
ACAATGGGAT ATGACGAAGA TTCAGCCTCG AGAATTCTTG AGTCAGCCTC AATCCCCGTT
TTCACGTTCC CGGAGGAGGC CGTGAACGCG ATAAGTAAGC TAGTGAAGAG ACCTTCACCA
AGAAGAAAGG TTAGGGTTAC CCAACCCATT GACAGCGCAC GGGAACTAAG CAAGGGTAAA
AGTTTCTTGG CAGATTATGA GGGGCTTAAG CTCATGGAAC TCTATGGAAT AAGGACGCCA
AGATGGGGGA TTGCTAACAC GCTGGAGGAG GCGCAGAGGC TAGCGGATTC TATTGGTTAC
CCTGTGGTCA TGAAAATCTC CACAGATCAG CCGGTACACA AGACCGAGCT CAAGGGGGTT
TACATGAACG TGGAAAGAGA CATGGTTAAG GAGAAGTTTG ATCTTCTCTC CAAGATTTCC
AAGAGGGTAA TGATTCAGGA ACAGCTAACA GGTCTAGAGG CCTATGTGGG TGGAATCAGG
GATCCGGTGT TTGGTCACAC AGTCCTGATC GGCGTGGGAG GAATATATGT AGAGGTCCTG
AAAAGTGTTA GCTATGGTAT CGCCCCAGTG TACGAGGATG AAGCTCTGGA AATGTTAAGG
GAAAGCAAGC TCCTGGACAT GATTAGGGCA AGAAAGAGGG GCTATGACGA GGGTTCTGTG
ATAAGAACGG TCTCCAATAT ATCGAGGCTC ATCTTGGATC TAAATGTGAA GGAGATGGAC
ATAAATCCTC TAATGGTCAA TGAGAAGGGA GCCTTTGCGG TTGACGTTAG GGTAACGTTT
TAG
 
Protein sequence
MSLDYLFRPR TIAVVGASRN REKVGNVIFR NVASTFRGKV YPVNNKSETI EGVQSYKSLK 
DIQDDVDLAI ISVPRESVPG VMEEAGEKGV RSAIVITSGF REVGEEGEKL EREVASIANK
YGIRFLGPNT MGLLTPDYNG TFAFADVKRG EIALVVQSGG IGAYMLNWAQ KSRTGVSFLV
SLGNQTDVKE YEVIDYLAKD PETKAIFVYI EGVSDGEKFL NVVPEATSRK PVIFIKGGAS
AQSAEAVKTH TGSLAGSYEV FKAAIKTVGG IFVENLKDFL NLSKLVNSSE PIREEILVVT
NSGGHGVLTS DAISRAGLSL VKIPERLNME LRKVLPPQSI PKNPLDLSGD AGRDRYLNSL
KIVSDLDCTK LVIVESLPFI SCTEVAKVLL NFKGKGIIGV TMGYDEDSAS RILESASIPV
FTFPEEAVNA ISKLVKRPSP RRKVRVTQPI DSARELSKGK SFLADYEGLK LMELYGIRTP
RWGIANTLEE AQRLADSIGY PVVMKISTDQ PVHKTELKGV YMNVERDMVK EKFDLLSKIS
KRVMIQEQLT GLEAYVGGIR DPVFGHTVLI GVGGIYVEVL KSVSYGIAPV YEDEALEMLR
ESKLLDMIRA RKRGYDEGSV IRTVSNISRL ILDLNVKEMD INPLMVNEKG AFAVDVRVTF