Gene Msed_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0401 
Symbol 
ID5105518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp351598 
End bp353205 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content50% 
IMG OID640506307 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001190502 
Protein GI146303186 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATATT ATAACTACAA TTTAACGGTG GATAAGCTCC TCGACTACGG GTCCACCGTG 
TACTCAAGCA AGGAGATCGT GTATAGGGAT AGGAGGCGTT ACACATTTTC CAAGTTTAGG
GAGAGAGTAG AAGCCTTCGC TCAATCGCTT CTTAAGATTG GCGTGAAGCC AGGTGACAAG
GTGGCTGTAG TTGACTGGGA CACAGACGTT TACATGACAG CATATTACGC CGTGCCCATG
ATAGGGGCTG TCCTGCACAC AGTTAACGTT AGATATCCCC CTGAGGTCAT GCTTAAGACG
GTGCTTCACG CTGAGGATAA ATGGGCCATT GTCAGGGATG AGTTTACCCC GCTCCTCGAG
AAGGGAAAGG CCTTTCTTGG CGGGCTGAAG GGAGTGATCA CATATTCAGA TGAGAGAGAG
AAGGTTAAGA CTAGTTTCCC TACCCACGAT TTCTGGGAGC TCCAGGAGAG CGGTGAAAGG
TTGCCTCCCC AGGAGATCAA GGAGGACATG CAGGCGACCG TTTTCTATAC CTCTGGTACC
ACAGGTGAGC CGAAGGGTAT ATGGTTCACG CACAGGGACC TGGTTCTTCA TTCCATGAGC
GTGGCGCTCA CCACATCTAA GCCTCCACTT AGGTCGTCAC AGGACGACAA CTTCATGATT
CTCGTTCCCA TGTTCCACGT TCACCAGTGG GGGTTCCCCT ACGTCACGAT GCTCGTGGGG
GCAAATTATG TTCTGCCTGG TAGGTATGAC CCAGCGATGG AGATAGAACT CATGAGAAAG
GAACACGTTA CATTCTCCGC CATGGTTCCC ACAATCCTTT ACATGATTCT ATCTCACCCC
AACGCGTCCA AATATCCAGA GGTGTTTAAG GGTTGGAAGG TTATGATAGG TGGGTCGGCT
TTGCCCACTG AACTGGCATA TGCAGCTAGG AAGATGGGGA TAAACATTGC CGTAGGATAT
GGGATGTCCG AGACAGCACC TGTGCTAACT GTGGCCTACT ACACACCCGA GGTGGAGAAG
TTGCCAGAGG AGGATAGGTT CCTCCAACAG ATAAAGACAG GAGTACCCAT ACCCCTGTCT
CAGGTAAGGG TAGTTGATGA GAAGGGAAAC GACGTTCCTA GGGACGAGAA GACTGTGGGC
GAGATAGTTG CCAGGGCGCC TTGGCTAACG AGGTCGTATT ATAAGGATCC CGAGAGGACC
GAAAAGCTCT GGAAGGACTC ATGGCTTCAT ACTGGAGACC TGGCGGTTGT GGACAAGTAT
GGATACATCA GGATCGTGGA CAGGGATAAG GACGCCATAA AGAGCGGAGG CGAGTTCATC
CCCTCGCTTA TACTTGAGGA CATCATCTCC ACCCATCCTA AGGTAGGAGA GGTAGCCATT
GTGGGGATGA AGGACGAGAA GTGGGGAGAG AGACCTGTGG CATTCATTGT TCCCAAGGGA
GATCTAAAGG AAGAGGAGAT AAGACAGTTC CTATTGACTA AGGTCGAGGA AGGGAAGTTG
CAGAAGTGGT GGATTCCAGA CAGGTTCGTC TTCGTAAAGG AATTCCCGAA GACTTCCACT
AATAAGATAG ATAAAAAGGC TCTACGTAAT CAACTAAGCT CTGGTTAA
 
Protein sequence
MGYYNYNLTV DKLLDYGSTV YSSKEIVYRD RRRYTFSKFR ERVEAFAQSL LKIGVKPGDK 
VAVVDWDTDV YMTAYYAVPM IGAVLHTVNV RYPPEVMLKT VLHAEDKWAI VRDEFTPLLE
KGKAFLGGLK GVITYSDERE KVKTSFPTHD FWELQESGER LPPQEIKEDM QATVFYTSGT
TGEPKGIWFT HRDLVLHSMS VALTTSKPPL RSSQDDNFMI LVPMFHVHQW GFPYVTMLVG
ANYVLPGRYD PAMEIELMRK EHVTFSAMVP TILYMILSHP NASKYPEVFK GWKVMIGGSA
LPTELAYAAR KMGINIAVGY GMSETAPVLT VAYYTPEVEK LPEEDRFLQQ IKTGVPIPLS
QVRVVDEKGN DVPRDEKTVG EIVARAPWLT RSYYKDPERT EKLWKDSWLH TGDLAVVDKY
GYIRIVDRDK DAIKSGGEFI PSLILEDIIS THPKVGEVAI VGMKDEKWGE RPVAFIVPKG
DLKEEEIRQF LLTKVEEGKL QKWWIPDRFV FVKEFPKTST NKIDKKALRN QLSSG