Gene Msed_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0804 
Symbol 
ID5105127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp733639 
End bp735249 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content49% 
IMG OID640506709 
ProductPyrrolo-quinoline quinone 
Protein accessionYP_001190903 
Protein GI146303587 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGAA TACTCGTAGC CATAGCGTTA GTTTCCGTGT TTTTAGTTGG TTCCTTCCTA 
GGAGCTCCCA TGATACAGTT CTTTTCCTCG GTGATAACGC CAATACCTCA ACCCATTGTA
AGGACTTACG ACATGTATAA CACAACCTAT TTCCCGTATG AGGTGAAAGT CGTGTATTAT
CCAGCTAATG CAACGAGTCA GAATCTGGAT CTACCAAGTT ATTGGGGAGT TACGAACGGT
GGCCAGTCCC ACAATGCAGC CCTTACGACC ACATGCACCG AGTTAATTCA GGGTGTTGTC
TGGCAACAGG ATTTTGCCCA TATGGCTGGT GCAGCCCTCA TTCCCATGAC TGCTCCACAG
AGTATGTTAC CCGGTGCTAG CGTCATGGGA ACAAGGTCCG CACTGGTAAT GTTAACTCAA
ATGGTGGGAG AACCGTTGGG CGTTACATTG GCCGATAATT TACTATTTGT GGAAGAGGAT
AGCGGACCAG GAAGTATCTT CGCAGTGAAT CCTCTGAACG GACAGGTAGT GTGGTATGCC
ACAGGACTAG CCAGTTACGC GATGAATAAC CCCATCGTTT ACAACGGGAT AGTGTATGTG
ACTGTGGGAG ATGTTGGTTT CAACTTCGCG AATTTCGTTC ACTACGAGAA GGGGCAATTC
TCCTCGATTC ACAGGGGGAT GGCATATGGA GCCATCTACG CGTTTAATGC CACTGACGGT
GAGCTGTTAT GGATGAGGTT CACGATGGGA GAGGCAATGC CAGCACCCGC GGTTTATAAC
GGAATCCTTG CCTATTCAGA CGGTGGTGGG GAGTTCATAG GAGTTAATGC GACCACAGGA
CAGGTCCTAT GGCAGGATAT GATGCCAGGC CTGTTTGATA GCATGAGTAG TGTAAACTAC
TACGTTCTGC CTAACGGCAC TCCCTTATTC ATTGCTGGAT TCACAAGTCT GACTGAGCCA
TATGGACTTC TGGTCGCTGT TAATGGAATG ACAGGGAAAG AGGTATGGAA TGCATCTCTT
CCTGCCCCCA ATAAGCCATT CAATACAGGG ATGGGAGATG TGCCCCCAGC TGTGGATCAG
CAGGCAGGTA TCGTTGTGCA GTCAACTGTA GCAAACGCCG AGCCCAATGG AACAGTTGAC
ACCATGGTTC TTGCGGTGAA TGCAACTAAC GGCCACGTTC TGTGGGTGAC AAATCTGGGC
AGAGGTTATA CTCCACCAGC ATTCAAAGGG GCAATCCCAA TGATTTACAA TAACACCGTT
TACGTAGGTT CTCCTTCACT GGGTAAGGAG TTTGCCCTAA ATCTCACTAA TGGCCAAATA
CTGTGGCAGA CCAGGCTTAA CGGGATAGGA TTACCACCAA AGGCTCCTGG TGGACCCAGG
GGTGGAGCAA CCTATTACGA CAACTTGCTG TGGGTAGCTG GAGGTCCTTA CGTTTACGTG
TTGAACCCCC ACAACGGTGA ACTATTGCAA CAGTACTATG TTGGCGGAAG GTTCGGCATA
GTTAACCCCG TGATAGTCGG AAGCACAATG TATCTAACTA ACAGTTACGG CTGGGTGGTG
GCGATCCCAC TCTATCAGAT CTACCCCGAC TACGTACTTT ACGCTAGCTA A
 
Protein sequence
MNRILVAIAL VSVFLVGSFL GAPMIQFFSS VITPIPQPIV RTYDMYNTTY FPYEVKVVYY 
PANATSQNLD LPSYWGVTNG GQSHNAALTT TCTELIQGVV WQQDFAHMAG AALIPMTAPQ
SMLPGASVMG TRSALVMLTQ MVGEPLGVTL ADNLLFVEED SGPGSIFAVN PLNGQVVWYA
TGLASYAMNN PIVYNGIVYV TVGDVGFNFA NFVHYEKGQF SSIHRGMAYG AIYAFNATDG
ELLWMRFTMG EAMPAPAVYN GILAYSDGGG EFIGVNATTG QVLWQDMMPG LFDSMSSVNY
YVLPNGTPLF IAGFTSLTEP YGLLVAVNGM TGKEVWNASL PAPNKPFNTG MGDVPPAVDQ
QAGIVVQSTV ANAEPNGTVD TMVLAVNATN GHVLWVTNLG RGYTPPAFKG AIPMIYNNTV
YVGSPSLGKE FALNLTNGQI LWQTRLNGIG LPPKAPGGPR GGATYYDNLL WVAGGPYVYV
LNPHNGELLQ QYYVGGRFGI VNPVIVGSTM YLTNSYGWVV AIPLYQIYPD YVLYAS