Gene Msed_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1456 
Symbol 
ID5104826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1424552 
End bp1426537 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content51% 
IMG OID640507344 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001191537 
Protein GI146304221 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTATGC GATATATTAT GGTTGAGGAA CAGACCCTGA AGACCGGGTC ACAGGAACTA 
GAGGAGAAGG CAGACTATAA CATGAGATAT TACGCTCACC TCATGAAGTT GAGTAAGGAA
AAACCTGCAG AGTTCTGGGG ATCTCTAGCA CAGGACCTGC TAGACTGGTA TGAGCCTTGG
AAGGAGACCA TGAGACAGGA AGACCCGATG ACAAGGTGGT TCATAGGAGG TAAGATAAAT
GCCTCGTACA ACGCTGTCGA CAGACACCTC AACGGCCCCA GAAAGTTCAA GGCTGCGGTC
ATCTGGGAAA GTGAGTTAGG GGAAAGGAAG ATCGTGACGT ATCAGGACAT GTTCTATGAG
GTTAATAGGT GGGCCAATGC GCTCAGATCC CTAGGAGTTG GTAAAGGGGA TAGGGTGACC
ATATACATGC CCCTGACCCC AGAGGGAATA GCTGCAATGC TGGCCTCGGC CAGGATAGGT
GCAATTCATA GCGTAATATT TGCCGGCTTT GGTTCGCAAG CCATAGCCGA CAGGGTTGAG
GACGCCAAGG CGAAGGTAGT GATCACTGCT GACGCCTATC CCAGAAGGGG AAAGGTTGTG
GAGTTAAAGA AGACTGTCGA CGAGGCCTTA AACTCCCTTG GAGAAAGGAG CCCAGTACAG
CACGTGCTCG TGTATAGGAG GATGAAAACG GATGTAAACA TGAAGGAGGG AAGAGACGTT
TTCTTCGACG AGGTCGGCAA GTACAGGTAC GTGGAGCCTG AAAGGATGGA CTCCAATGAT
CCACTCTTCA TTCTCTACAC CTCTGGGACC ACCGGTAAAC CTAAGGGAAT TATGCACTCT
ACCGGTGGTT ATCTGACCGG GACAGCCGTT ATGCTACTGT GGAGCTACGG CCTTAGCCAG
GAGAACGACG TTCTCTTCAA CACCTCAGAT ATTGGTTGGA TAGTTGGCCA CTCCTACATT
ACCTATTCCC CCCTTATCAT GGGGAGAACG GTTGTCATTT ACGAGAGCGC CCCAGACTAT
CCCTACCCAG ACAAGTGGGC TGAGATTATT GAGAGATACA GGGCAACCAC TTTCGGCACC
TCAGCTACAG CCTTGCGTTA CTTCATGAAG TATGGGGACG AATACGTGAA GAACCACGAT
CTCTCGTCCA TCAGGATAAT TGTGACGAAC GGGGAAGTGC TTAACTACTC TCCGTGGAAG
TGGGGGCTAG AAGTGTTAGG TGGAGGAAAG GTATTCATGT CCCATCAGTG GTGGCAAACT
GAGACAGGCG CACCGAACCT GGGCTACCTT CCGGGTATAA TTTACATGCC AATGAAGTCG
GGTCCAGCCT CAGGCTTCCC TCTACCCGGT AACTTCGTGG AGGTTCTGGA CGAGAACGGA
AATCCCTCTG CCCCTAGAGT GAGAGGATAC CTTGTAATGA GGCCACCCTT CCCGCCTAAC
ATGATGATGG GGATGTGGAA CGATAATGGG GAGAGGTTGA AGAAGACGTA CTTTAGCAAG
TTCGGTTCCC TGTATTATCC AGGAGACTTC GCCATGGTGG ATGAGGATGG ATACATCTGG
GTGTTGGGTA GGGCAGACGA GACTCTAAAA ATTGCAGCCC ACAGAATTGG AGCTGGGGAA
GTGGAATCAG CAATCACTTC TCACCCATCG GTTGCCGAGG CAGCAGTCAT AGGCGTGCCA
GACTCAGTGA AAGGAGAAGA GGTTCACGCG TTCGTTGTGC TAAAGCAAGG TTACGCTCCT
TCCTCTGAAC TGGCTAAGGA CATACAGTCA CACGTTAGGA AGGTCATGGG GCCCATTGTT
AGTCCGCAGA TTCATTTCGT GGATAAGTTG CCTAAGACAA GGTCTGGGAA GGTCATGAGA
AGGGTGATAA AGGCAGTGAT GATGGGTTCG AGTGCTGGCG ACTTAACCAC CATAGAGGAC
GAAGCATCAA TGGACGAAAT AAAGAAGGCT GTCGAGGAAC TAAAGAAGGA GTTAAAGACC
TCCTAG
 
Protein sequence
MFMRYIMVEE QTLKTGSQEL EEKADYNMRY YAHLMKLSKE KPAEFWGSLA QDLLDWYEPW 
KETMRQEDPM TRWFIGGKIN ASYNAVDRHL NGPRKFKAAV IWESELGERK IVTYQDMFYE
VNRWANALRS LGVGKGDRVT IYMPLTPEGI AAMLASARIG AIHSVIFAGF GSQAIADRVE
DAKAKVVITA DAYPRRGKVV ELKKTVDEAL NSLGERSPVQ HVLVYRRMKT DVNMKEGRDV
FFDEVGKYRY VEPERMDSND PLFILYTSGT TGKPKGIMHS TGGYLTGTAV MLLWSYGLSQ
ENDVLFNTSD IGWIVGHSYI TYSPLIMGRT VVIYESAPDY PYPDKWAEII ERYRATTFGT
SATALRYFMK YGDEYVKNHD LSSIRIIVTN GEVLNYSPWK WGLEVLGGGK VFMSHQWWQT
ETGAPNLGYL PGIIYMPMKS GPASGFPLPG NFVEVLDENG NPSAPRVRGY LVMRPPFPPN
MMMGMWNDNG ERLKKTYFSK FGSLYYPGDF AMVDEDGYIW VLGRADETLK IAAHRIGAGE
VESAITSHPS VAEAAVIGVP DSVKGEEVHA FVVLKQGYAP SSELAKDIQS HVRKVMGPIV
SPQIHFVDKL PKTRSGKVMR RVIKAVMMGS SAGDLTTIED EASMDEIKKA VEELKKELKT
S