Gene Msed_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1015 
Symbol 
ID5104318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp935268 
End bp936764 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content55% 
IMG OID640506914 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001191107 
Protein GI146303791 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00519801 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATTAGAG GCCCAATTTT ACCCGACCTC AGACCTAGAA CCCAGAGCGA TATCCTAGAG 
TCCTCTGGGG AAGGCGTGGC CATAAACTTC CTTGGCAACA GGATAAGTTA CCCCGAGCTG
AGGGGAATGG TGGAGAGCGT TTCCTCCCAA CTGGAAATTG GCCGGGGAGA CGTGGTAATC
CTCTCCACCC AGAACATACC GCAGTTCGTC ATTGCCGAGT ACGCGGTGTG GAGGAAGGGA
GGGATTGTGT TGCCCGTTAA TCCCAGTTAC ACGCAAGCTG AACTGGACTA CTTGGCAAGG
GACTCTGGGG CGAAGCTCGT GATCGCGTCA TGTGAGTCCA ACGTTCCCTC GAACTTGCCT
GTGATCAGGA CCAATCCTCA CACCTTTCAC AAGGTGGAAG GGTGGAATAT CCCGGACTGC
GAAGAGGAAC TCAACCTCAA GTCGGGGAGA GGGGACAGGG TGAACTACTC CCCCCAAGAG
GTCGCGGTCC TGATGTACAC CTCGGGAACA ACGGGGAAGC CCAAGGGAGT TCCGATTACG
CACTCCAACC TTTACGCCTC CTCTCTCATC TACGTGAGGT GGTTCCAGTT CACGGGACGG
GACAAGGTCC TTGGGATCGC ACCATTCTTC CACGTGACTG GGCAGGTCTT CCACGTGACC
ACGCCCGTGA TGGCTGGGTC GCAGATAGTA GCAACTTTCA GGTTCGATCC CAGGTCAGCA
CTTAGGACAG TTCAAGAGGA GAGGACCACG GTAACCATGA GCGTTGCCAC AGCGTATAGG
GCCATGCTCA ACTCCTACTC TGGGGAAGAC CTAACGTCGA TGAGGTTATG GTCCTCTGGC
GGGATGCCAA TGCCTCGAGC CCTAGAGGAG GAGTGGAAAA GGTTGACAGG TTCCTGGATC
TATATGGCCT GGGGCCTCAC GGAGACCACA TCACCGGCCA CGCTGTGGCC TTACCCCTAC
TCAGGCGAGC TACCGGTTAA CGAAATGGGT GTAGTGAGCT CTGGGATGCC CGTGTACAAC
ACAGAGATCG AGTTGGAGGA CGGCGAGCTC CTGGTGAGGG GTCCTCAAGT CGTGAAGGGT
TACTGGAAAC AGGAGGAGTT CAAGGACGGA TGGCTTCACA CAGGGGACAT TGGCGAGATA
AGAGATGGTT GGGTTTACGT AATAGACAGG AAGAAGGATG TCATAGTTAC CTCGGGCTTC
AAGGTAATGC CGAGAGAGGT GGAGGAAGTT CTTCATCTTC ACCCTGGGGT TGACGAGGCA
GTTGTCGTGG GTATACCGGA CGAGTACAGG GGCGAGCGGG TAGTAGCCTT CGTGAAACCT
AGACCGGGAG CCAAACTGAA CCTCGAGGAA CTTAAAGAGT TCTGTAGGAC AAGGCTAGCC
CCATACAAGG TCCCCAGAGA GATCAGACTT GTGGACGAGA TTCCGAAGAC AGGTTCAGGC
AAGATTATGA GGAGAGCCTT CAAGGAGGAG AGGTCACCAA GTCATAGTAA CAGTTAA
 
Protein sequence
MIRGPILPDL RPRTQSDILE SSGEGVAINF LGNRISYPEL RGMVESVSSQ LEIGRGDVVI 
LSTQNIPQFV IAEYAVWRKG GIVLPVNPSY TQAELDYLAR DSGAKLVIAS CESNVPSNLP
VIRTNPHTFH KVEGWNIPDC EEELNLKSGR GDRVNYSPQE VAVLMYTSGT TGKPKGVPIT
HSNLYASSLI YVRWFQFTGR DKVLGIAPFF HVTGQVFHVT TPVMAGSQIV ATFRFDPRSA
LRTVQEERTT VTMSVATAYR AMLNSYSGED LTSMRLWSSG GMPMPRALEE EWKRLTGSWI
YMAWGLTETT SPATLWPYPY SGELPVNEMG VVSSGMPVYN TEIELEDGEL LVRGPQVVKG
YWKQEEFKDG WLHTGDIGEI RDGWVYVIDR KKDVIVTSGF KVMPREVEEV LHLHPGVDEA
VVVGIPDEYR GERVVAFVKP RPGAKLNLEE LKEFCRTRLA PYKVPREIRL VDEIPKTGSG
KIMRRAFKEE RSPSHSNS