Gene Msed_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2032 
Symbol 
ID5105254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1958264 
End bp1960021 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content46% 
IMG OID640507920 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_001192096 
Protein GI146304780 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.329827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTAA AGGAGTTTGT TGTTTCGTTG TTCCAATTGG ACAAGGACTG GACTACTAGG 
ATTGTCATGG CTATGCTAGT CATGGGAGTG ATTTGGGGCC TATTGGGAGT GATAGACTCC
CTTATGGTGA GGATACAAGA GACCGCGTGG GGGCTGAGTG GAACCCTAGT GTTCACTCCG
CAGGAGTACT TCGCCAGCAT CACGTTGCAT GCTGAGAGGG ACCTCTTCGG ATTCGCTCAG
CAGGTGATAT ATGCCATTTT CATTTATTTC ACAATTAAGC TTCTTAACCT GCAACCTAGG
GCTAAGTGGT TGCTCAACAT CTCCTTCATC CTGATTAACA TTTCAATGAT GTTTATGGAA
GGGCCGATTG TCGTTGCTCC CACTTTTAAT GACAACTACT TCAGTGCTAC CGACTGGTAC
TACATATCTC CCATGGGAAT TCCCAACTAC TCCAATTACG TGGTATCTCC GCTATTCTTT
TACGGTTGGC TTCTACTGGA CGCCTTCACC TATCTAGCGG GAATATGGAT CGTGTACCAT
TATTATATAG CGTCCAAGCA ACTTAAGGAG AAGCTACCTG TTCCTCTGGT ATTTTTCCTC
ATGAATACCT TGCTTTTCAT GATAGGCTAC TCTGGGGTAA CCGCAGCTGA TGTATGGGAC
ATACTAGCGT TCTATAACCT GGTGCCCCTC AACGCAATTG CGAACCAAAT AGCCTTCTGG
ATCTTTGGGC ATGCTATCGT GTATATGGCG TGGATGCCTG CAGTTGGAGC ACTTTATCTT
CTTATTCCAA CACTAGCTAA CAAACCGCTT TACAGCGATA GAATGGGTAG AATTTCTGCT
CTGCTCTACT TAATATTTTC CAACAATGTA CCCATTCATC ACCTATACAT GGTTAACCTT
CCAGTCTCTA TCAAGATACT CCAGGAAGTT CTTACGTATG CAGTTGTGGT CCCCTCAATG
TTGACCTTCT TCAATCTATG GGCTACGGTA AAGGGCGCAC AGGTTAAGTT CAACGTGATA
ACTGCTTTCA CTGTGACATC CTTCTCTGGG GCCATAGCTG CAGGAGTTAC CGGTATATCT
AACGCAACCA TAGCCTTCGA CGCAATAATT CACAACACCG ATTGGGTAGT GTCCCACTTC
CATGCAATGA TACTGCTGTC CATAGTTCCA GCAGCAATGG CAGTGCTTTA CTTCATGATA
CCCATGATGA CTGGGAAGCA GTGGTTCTCA TCCAAGATGG CGTGGGTTCA TTGGATTGGT
TACGTGTTTG GATCCATTCT CTTCATAGTG GGCTATGAAT TGCAGGGATT CGAGGGCTTG
GTGAGAAGGG CTGAGATCTA TCCTAGAGTA CCGACTCTAA TTACGGCAGA GGTTATCTCC
ACTGTGGGTG CAGTAATAGC AGAGCTTGCT ACCCTAGTTT GGTTCCTGAA CCTGGTCCTT
ACCCTAGTTA AGGGTAGAAA TATGAACCTA GAGGGAGTAG GGCTTGGCCA GCTCATTGGC
ACCGTGGGAG CTGCACTAGA GTGGAATGGA GAAAACATTA ATATTCCAAG TTTATTCAGT
AAAAACATGA TAAAGAAAGG ATTAAGTGGC CTGTGGACTC TGGGAATATT GGGTGCACTT
GTGATAGTGA TTAGCATGTT CCCACTGGCC TTCTCTGGTA ACACTTACAA CGCAATGCCA
TGGATCTGGA TAGTCCTGCT GTCCATAGGG ATAGTGCTGA TCTCGTATCC CGTATTAAAG
GGGGCTAAAA GTTTATGA
 
Protein sequence
MGLKEFVVSL FQLDKDWTTR IVMAMLVMGV IWGLLGVIDS LMVRIQETAW GLSGTLVFTP 
QEYFASITLH AERDLFGFAQ QVIYAIFIYF TIKLLNLQPR AKWLLNISFI LINISMMFME
GPIVVAPTFN DNYFSATDWY YISPMGIPNY SNYVVSPLFF YGWLLLDAFT YLAGIWIVYH
YYIASKQLKE KLPVPLVFFL MNTLLFMIGY SGVTAADVWD ILAFYNLVPL NAIANQIAFW
IFGHAIVYMA WMPAVGALYL LIPTLANKPL YSDRMGRISA LLYLIFSNNV PIHHLYMVNL
PVSIKILQEV LTYAVVVPSM LTFFNLWATV KGAQVKFNVI TAFTVTSFSG AIAAGVTGIS
NATIAFDAII HNTDWVVSHF HAMILLSIVP AAMAVLYFMI PMMTGKQWFS SKMAWVHWIG
YVFGSILFIV GYELQGFEGL VRRAEIYPRV PTLITAEVIS TVGAVIAELA TLVWFLNLVL
TLVKGRNMNL EGVGLGQLIG TVGAALEWNG ENINIPSLFS KNMIKKGLSG LWTLGILGAL
VIVISMFPLA FSGNTYNAMP WIWIVLLSIG IVLISYPVLK GAKSL