Gene Msed_0394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0394 
Symbol 
ID5103637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp341742 
End bp343391 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content53% 
IMG OID640506300 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001190495 
Protein GI146303179 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGAT TCAAAATACC CAACTACGAG GGGGTAGATC CAACAGGTTC TTGGTACTCG 
GTTCTAACTC CTCTCCTCTT CCTTGAGAGA GCTGGCAAAT ACTTCAAGGA CAAGACGGCA
GTTGTTTACA GGGACAGTAG ATACACCTAT TCCACCTTCT ACGACAACGT CATGGTTCAG
GCCAGTGCCC TGATGAGGAG AGGGTTCTCG AGGGAAGACA AACTTTCCTT CATTTCACGG
AACAGGCCTG AGTTCCTTGA GTCGTTCTTT GGGGTTCCAT ACGCGGGTGG AGTTCTAGTC
CCCATCAATT TCAGGCTCTC ACCTAAGGAG ATGGCCTACA TTATCAACCA TTCCGACTCC
AAGTTCGTTG TGGTGGACGA GCCGTACTTG AACTCACTGC TGGAGGTAAA GGACCAGATT
AAGGCTGAGA TCATCCTCCT GGAGGACCCA GACAATCCCA GTGCCAGCGA GACAGCCCGG
AAAGAGGTGA GGATGACCTA TCGCGAGCTA GTGAAGGGAG GGTCGAGGGA TCCTCTCCCT
ATCCCAGCGA AGGAGGAATA CTCCATGATA ACGCTTTACT ACACCTCAGG GACTACGGGT
CTGCCCAAGG GGGTGATGCA TCATCACAGG GGTGCCTTCC TTAACGCCAT GGCCGAGGTA
CTGGAACACC AGATGGACCT CAACTCGGTT TACCTCTGGA CCCTCCCAAT GTTCCATGCT
GCGTCCTGGG GTTTCTCTTG GGCCACAGTT GCTGTGGGAG CAACCAACGT GTGCCTGGAC
AAGGTAGATT ACCCCTTGAT ATACAGGCTC GTGGAGAAGG AGAGGGTCAC TCACATGTGC
GCTGCCCCAA CGGTTTATGT GAACCTAGCA GATTACATGA AACGTAACAA CCTTAAGTTC
AGTAACAGGG TCCACATGTT AGTGGCAGGC GCAGCTCCCG CCCCGGCTAC CTTGAAGGCA
ATGCAGGAGA TTGGCGGTTA CATGTGTCAC GTGTATGGGC TTACCGAGAC CTACGGGCCC
CACTCCATCT GTGAGTGGAG AAGGGAATGG GACTCGCTTC CCCTGGAGGA ACAGGCCAAG
CTCAAGGCAA GACAGGGCAT ACCATATGTA AGCTTTGAGA TGGACGTGTT TGATGCTAAC
GGCAAACCTG TTCCATGGGA TGGGAAGACC ATTGGCGAGG TAGTAATGAG GGGTCATAAC
GTAGCTCTTG GGTATTATAA GAACCCCGAG AAGACCGCGG AGTCTTTCAG GGATGGGTGG
TTCCACTCAG GAGACGCTGC CGTGGTTCAC CCAGACGGTT ATATCGAGAT AGTGGATAGG
TTCAAGGACC TGATCAACAC AGGAGGCGAG AAGGTTTCCA GCATTCTCGT GGAGAAAACT
CTCATGGAGA TCCCTGGCGT GAAGGCAGTA GCCGTGTATG GTACTCCAGA CGAGAAATGG
GGAGAAGTGG TAACTGCGAG GATAGAATTA CAGGAAGGAG TCAAGCTCAC GGAGGAGGAG
GTGATAAAGT TCTGCAAGGA GAGATTAGCT CACTTCGAGT GTCCCAAGAT TGTGGAGTTC
GGTCCCATAC CCATGACCGC CACGGGTAAG ATGCAGAAGT ACGTGCTCAG GAACGAGGCT
AAGGCCAAGG CAAACAAGGA GAAGTCTTAG
 
Protein sequence
MGGFKIPNYE GVDPTGSWYS VLTPLLFLER AGKYFKDKTA VVYRDSRYTY STFYDNVMVQ 
ASALMRRGFS REDKLSFISR NRPEFLESFF GVPYAGGVLV PINFRLSPKE MAYIINHSDS
KFVVVDEPYL NSLLEVKDQI KAEIILLEDP DNPSASETAR KEVRMTYREL VKGGSRDPLP
IPAKEEYSMI TLYYTSGTTG LPKGVMHHHR GAFLNAMAEV LEHQMDLNSV YLWTLPMFHA
ASWGFSWATV AVGATNVCLD KVDYPLIYRL VEKERVTHMC AAPTVYVNLA DYMKRNNLKF
SNRVHMLVAG AAPAPATLKA MQEIGGYMCH VYGLTETYGP HSICEWRREW DSLPLEEQAK
LKARQGIPYV SFEMDVFDAN GKPVPWDGKT IGEVVMRGHN VALGYYKNPE KTAESFRDGW
FHSGDAAVVH PDGYIEIVDR FKDLINTGGE KVSSILVEKT LMEIPGVKAV AVYGTPDEKW
GEVVTARIEL QEGVKLTEEE VIKFCKERLA HFECPKIVEF GPIPMTATGK MQKYVLRNEA
KAKANKEKS