Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1368 |
Symbol | |
ID | 5103427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1337397 |
End bp | 1339268 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507257 |
Product | hypothetical protein |
Protein accession | YP_001191450 |
Protein GI | 146304134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCCT GGATTCTCCT GATTGTTCTG GTCTTGGGGT TACTCCCAGG TGTAGGGGCT TTTTCTTCAC CAACCATGCC TCACAACTTC ACCCTATACA ACATAACGGC GATGCAGGGT CTAGATCCGA AGTACTACTC CTTCGAGGCT GTAGGGTATC TCCCACCCAA CGAGACTCCT GTCACTGTCA CGGTAGCCAA GAACCTAATA TTCAACGATA CGGCGTGGAA ACCCTACTAC TTGAACGTTC ACATTCCCAA TGGATCCTAC AGCTTCATCC TCATGAACGT TAGCGTGAAG GAAGAGAATG GAACCCAGTT TGATCGCCCC TTCTACATTT TCGTGAACGG CATACCGGTG TTCTGGGGAT CTACACAGGA GATTCAGAAC TCGAGTGCCT CTGTAGACCT AACCATGTTT GAAAACCTCC TACACGGAAA CGTAACCTTT GAACCTGTCC TCGTGAATTT CTACGACGCA AAGGTGAACA TAACAGGGAT CTATCTAGTC AACATAACTC TGTCCCTCTA CCCAGGACAG GCCCCGAGTA ACTTGCCCAA CGAGTTCATA CCTCTCTTCG TGAATGGGAC TTTCAACTAC AACTACTCGT ACGTCATTCT TAACCCTAAT CAGGACACTA TTACTTCTTC GGTTAAGTTA CCTAACGGCA CGTATAGGAT GTCAGCTTTC CTTTACGAGG AAGGCGGAGG ACTAGACGAG TTCTGGTACA GTAACGAGCC GGCCACCAGG GACATCCTGC TCTACTATGA CGGTCTCCTC GGAGGAGTAG TTCCTCCTTA CGAAACAATA TACACTGGTG GTATTGATCT GTTCTGGTGG AAGCCACTTT CCAGCATTAA CACGCTGGCA TTTCACACGC CCTATCAGGT GGATCTTACT CCTCTTCTGG CGCTGGGATC TAATGCTAAC GTCACAGTGA CAGTCTCTAA CTTAGGAACT GCGAAGGAAC TAACGGGTAG TTCCTCCTTC GACTGGGACC TATCAGGGTT CCTGGCTCTG TGGGTAAATC AGAGCAATCC TCTAATCTCT GGGCAGGTAG TGAAGGCCTA TACTAGGTTT ATTGACTCCT CGCCCATCTT TGTTGGCGGT TTCTCAGGGG TTCATTATCA GGAGGGAGGT AGCTACACCC TCACTTACTC CTCAATTCTA AGGTTCCTTC ACGGTACCGA GATGGCAACA GTCTCCCAGA CAGGAAGATT CTACGCCTCC CAGACCTTCA ACAACATCTA TCAATTCGCA TATCTGGACG AAACCTTCAA GGAAATTGCA AATGAGACCG GGTTCTACTC CTCCTCCATG TATCTGGCGG GCAACTACCC CGTGACACTG CAGATTTCAG CGTTTGCCAC TCCAATAACC TCACCTAACG TGATACCCTT CAATCTGTCA TATGCGCAGA ACGGATCCAT TCAACTGGGT GCGAATTACC TGTACTCATT TAGCCTTAAC GGTTACGTCA CGAGGCAATC CCTTCAAGAG AACTTGACCG CTCAGGGAGG TTTCTCTGGG ATAATTGAGG TGATCAATAG TTATGGCGGA GCCGTTCTCG TTAAGCTCAC GTCCAATAAC GCCCTAACTC AAAAGTACCT CACCTTCATC TATCAGGAAC CGGGTGTAAC CGAGTTCAGG GAAAACTTCT TCGCCATGGC CGGACAGAAC AGCTCCGTGA ATGCTACGGG CTACTACCTG AAAATACAGA GGAGTTTTAC ACCACTAACT GACCCTGCCT ACCAGGAAGT TGTAAGCTTC ACTGAAGATC ATCTTACTGT AGATCATCTG GTGGCATATC ATCGAAGTAT TCTGGCGGAA CTTCCTCGCT TTGCTCTTCT TCCACATCCC TCCCCTTTTT AG
|
Protein sequence | MKSWILLIVL VLGLLPGVGA FSSPTMPHNF TLYNITAMQG LDPKYYSFEA VGYLPPNETP VTVTVAKNLI FNDTAWKPYY LNVHIPNGSY SFILMNVSVK EENGTQFDRP FYIFVNGIPV FWGSTQEIQN SSASVDLTMF ENLLHGNVTF EPVLVNFYDA KVNITGIYLV NITLSLYPGQ APSNLPNEFI PLFVNGTFNY NYSYVILNPN QDTITSSVKL PNGTYRMSAF LYEEGGGLDE FWYSNEPATR DILLYYDGLL GGVVPPYETI YTGGIDLFWW KPLSSINTLA FHTPYQVDLT PLLALGSNAN VTVTVSNLGT AKELTGSSSF DWDLSGFLAL WVNQSNPLIS GQVVKAYTRF IDSSPIFVGG FSGVHYQEGG SYTLTYSSIL RFLHGTEMAT VSQTGRFYAS QTFNNIYQFA YLDETFKEIA NETGFYSSSM YLAGNYPVTL QISAFATPIT SPNVIPFNLS YAQNGSIQLG ANYLYSFSLN GYVTRQSLQE NLTAQGGFSG IIEVINSYGG AVLVKLTSNN ALTQKYLTFI YQEPGVTEFR ENFFAMAGQN SSVNATGYYL KIQRSFTPLT DPAYQEVVSF TEDHLTVDHL VAYHRSILAE LPRFALLPHP SPF
|
| |