Gene Msed_1755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1755 
Symbol 
ID5104755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1691572 
End bp1692732 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content49% 
IMG OID640507650 
ProductDNA topoisomerase VI subunit A 
Protein accessionYP_001191834 
Protein GI146304518 
COG category[L] Replication, recombination and repair 
COG ID[COG1697] DNA topoisomerase VI, subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0808112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.29442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGT TCGTATCTAA GGTTGACAAG GAGGCAAGGA GTAGGGCTGC CGAGACTTTA 
AAGAAGAACT TCCTCACACT CTTGGATCAG CTAAATAGGG GAGAACCCCT GGTCATGGAA
ATCCCCAAGC GTACCCTGTC TAACACCGTT TACGATGAAA AGAGGAAACT CCTCCTCTTG
GGAGAGGAGA AGATGAAGAG AAGTTTCCTA GATATGAACG AGGCCAGGAG GTTCATGCAG
ACCGTGCTCA TGGGAAGCAT AATTTACGAT GCTCTCGTTA ACGACGAGTA CCCCACCATA
CGTGACTTGT ATTACAGGGG AAAGCACTCC ATAATTCTCA GGGATCCCAA GGGAAAGACC
CACGAGGAGA ACACCTGGGA CGAGCAAAAG GAATCTGACA GCGTCATAAT AGACATTGAG
GTCTTCACCT CGCTCCTTCG TGAGGAGATG CTGATCCTGA GCAAGGAGAA GGGGAAAGTG
GTTGGGGATA TGAGGTTAAG GAGCGGTAAC GACATAATAG ACCTAAGCAA GATGGGCCAC
GGGGCTTACT CCATTGAGCC TACCCCAGAC CTGATAGATT TCGTTGAGGT TAACGCTGAT
TACGTCCTTG TGGTGGAGAA GGATGCTGTA TTTCAGCAAC TGCATAGGGC AGGCTTCTGG
AGGAAGTATA AGTGTATTCT GATCACCAGT GCAGGACAGC CCGATAGGGC GACCAGGAGA
TTCCTGAGAA GGCTAAACGA GGAATTGAAG CTACCCGTTT ACATTTTGAC AGACGCTGAT
CCATATGGAT GGTATATTTA CAGCGTTTTC AGGATAGGCT CGATCTCTCT CTCCTACGAG
AGCGAGAGGC TCGCCACTCC AGATGCCAAG TTCTTGGGCG TCTCCATGGG CGATATCTTC
GGAACTCCCA AGAAGAAGGC CTACCTCAGC GAGAGGGAGA GGTCGAGTTT CATAATAAAG
GCAAAGGAGA CCGACATAAA GAGGGCACTG GAGATCAAGA ACTACAGCTG GTTTAAGACC
AAGAGCTGGG AAGAGGAGAT AAACATGTTC CTGCAGAAGA AGGCCAAGCT AGAGATAGAG
GCCATGGCAA GCAAGGGTCT CAAGTTCCTA GCCTTCCAGT ACATCCCTGA AAAAATTCAG
ACCCAGGACT TCATAGGGTA G
 
Protein sequence
MSEFVSKVDK EARSRAAETL KKNFLTLLDQ LNRGEPLVME IPKRTLSNTV YDEKRKLLLL 
GEEKMKRSFL DMNEARRFMQ TVLMGSIIYD ALVNDEYPTI RDLYYRGKHS IILRDPKGKT
HEENTWDEQK ESDSVIIDIE VFTSLLREEM LILSKEKGKV VGDMRLRSGN DIIDLSKMGH
GAYSIEPTPD LIDFVEVNAD YVLVVEKDAV FQQLHRAGFW RKYKCILITS AGQPDRATRR
FLRRLNEELK LPVYILTDAD PYGWYIYSVF RIGSISLSYE SERLATPDAK FLGVSMGDIF
GTPKKKAYLS ERERSSFIIK AKETDIKRAL EIKNYSWFKT KSWEEEINMF LQKKAKLEIE
AMASKGLKFL AFQYIPEKIQ TQDFIG