Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2164 |
Symbol | |
ID | 5104903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 2080228 |
End bp | 2081976 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640508057 |
Product | hypothetical protein |
Protein accession | YP_001192227 |
Protein GI | 146304911 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000710411 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAATTTG AACCAATAGT GGGAAGGCTC AGGGAGGTCA GAGTAACCAC AAGGCCCACG GCAGATGGAA GAAGTTCCGT AACGTTCAGG AATTTCGTAG TGGAGATGCC ATACTCCAAG GACCTTAACC TCAACGTGGG AGCCCTTCTC GCGGTGGAGA CCATAAGGAG TAATACCTAC CTTATCCTTG AGGTCGCTGA CTACGTTCCC GTCCATTATG GAATGATAAA CATGGATGGA TCGATACCCA AGGAGATAAG GGATCAGGTA ATGGAGGAGG TCTCCAAGAG TTGGAAAGAC GGGAATAGCA CGGAGATGTG GATAGATGTG TTTGCGTATC CCATAGGATA CATCGCAACC CCTGAAGGGT TCAAAAAGGG ATATCTTCCA CCTCTTCCAG GCTCTCCAGT TAGGATTCTA AGCAGGGAGT CATTCAGGGA GTTCGTTTGC GCAAGGAATG GAGTGGAGAT CGGGAAAGTA ATAGGGGAAG ATATCCCCCT GACCGTGGAT CTTTCGAAGG CTATGGTTTA CCACATGGGA GTCTTCGCCT TTACTGGGTC GGGCAAGTCT AACCTCACCG CCTCCATAAT AAGGAGAATC CTCAATAACA CAAACGCCAA GGTTGTGATC TTCGACGTTT CCATGGAGTA TTCAATCCTT TTGTTGGACC AGCTTCTGGC CCAGAGGGCC GAGATCCTTA CCACGGATAG GTACTCACCC AATCCCTTAG ATGCTAGCAG GAAGTTCATG AGAACCCACG TAATACCAGA GGAGTTAGAG AAGTTCAGGG AGAACATTAG GAGGAGAGTT GAGGAACTGT TCACGTCGGG AAAGATAAGA ACCCTCTATA TCCCTCCAGA GGGCTCAATG GGCTTAACCT TCGAAACCCT CCTGGAGCTG GTGAAGGATC AGATAGATGA CAAGTACACG GCCTTCGCTC AGAAGCCCCT GTTTAGCCTC ATGCTCAGGA AGCTTGACAC CTTCATGAGG CAGAACAGAA TTTCCAAGGA CGCTCCTCTA GACGACTCAA TCCTTTCAAT ACTAGATGAA ATGGAAAATG AGGGTAGAAA CGCTGGTCTG AAGGAGAACT CATCGCTCTT CTCATTCATA TCATCCTTAA GATCATACAT AAACACAGAG GTCGAGGAAA GTGAGGAGTA CGACGTCGAG AAGCTAGCAA TAGACATTCT AGATAAAGAT GAGTCCTCTC CCAGACTTTT CATCCTAGAG CTACAGAATC TGGAGGAATC CAGGGAAGTT GTGGCGTCTC TGCTCGAGGA GGTTATGTCA AGGAGGAAAA GGTCCTTCAG TACTTCCCCA ATTCTCTTCG TTTTGGACGA AGCTCAGGAG TTCATTCCAT TTGATACTAG GCAGAGAGAC AAGAGCGAGC TGTCCAGTAA CGCCGTGGAG AAATTGCTGA GACATGGAAG GAAGTATCAC CTTCACGCCC TGATCAGCAC CCAGAGGTTG GCTTACCTAA ACACGAACGT TCTTCAGCAA CTCCACACTT ACTTCATCAG TGTCTTACCC AGGCCCTACG ATAGGCAGTT GGTCTCTGAA ACCTTTGGGA TAAACGATAC CCTCCTCGAT AGAACCCTTG ATCTTGAGGT TGGACAATGG CTCCTTGTGA GCTTCAAGGC ATCCTTACCG CACGATGTTC CAGTTTTCTT TACAGCACCC AACAACCTAG AGGAGGTGAG AAGGGCGCTT GAGGAGAATA GACCAGCTAA TCCTGTCAAT GGTAAGTGA
|
Protein sequence | MEFEPIVGRL REVRVTTRPT ADGRSSVTFR NFVVEMPYSK DLNLNVGALL AVETIRSNTY LILEVADYVP VHYGMINMDG SIPKEIRDQV MEEVSKSWKD GNSTEMWIDV FAYPIGYIAT PEGFKKGYLP PLPGSPVRIL SRESFREFVC ARNGVEIGKV IGEDIPLTVD LSKAMVYHMG VFAFTGSGKS NLTASIIRRI LNNTNAKVVI FDVSMEYSIL LLDQLLAQRA EILTTDRYSP NPLDASRKFM RTHVIPEELE KFRENIRRRV EELFTSGKIR TLYIPPEGSM GLTFETLLEL VKDQIDDKYT AFAQKPLFSL MLRKLDTFMR QNRISKDAPL DDSILSILDE MENEGRNAGL KENSSLFSFI SSLRSYINTE VEESEEYDVE KLAIDILDKD ESSPRLFILE LQNLEESREV VASLLEEVMS RRKRSFSTSP ILFVLDEAQE FIPFDTRQRD KSELSSNAVE KLLRHGRKYH LHALISTQRL AYLNTNVLQQ LHTYFISVLP RPYDRQLVSE TFGINDTLLD RTLDLEVGQW LLVSFKASLP HDVPVFFTAP NNLEEVRRAL EENRPANPVN GK
|
| |