Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_0333 |
Symbol | |
ID | 3997622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 313510 |
End bp | 314436 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637958158 |
Product | cell surface glycoprotein (s-layer protein) |
Protein accession | YP_565079 |
Protein GI | 91772387 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3420] Nitrous oxidase accessory protein |
TIGRFAM ID | [TIGR03024] PEF-C-terminal archaeal protein sorting domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0000130039 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATTA ATTATAAAAC CAATGGAATT ATAAAAATAA TTCTATTGCC ATTATTCATG GCTATTCTAA TGACACAGGC GAGTGCATCT ATCCTTGAGG TGGGGGAAGG GCAAGAGTAT TCACATATCC AGGATGCTGT GAACAATGCA AATGAGGGTG ACAAGGTCAT TGTTCACAGT GGTATTTATG AAGAAAATGT TATTCTGGAC AAACAAATAA TATTGCAGGG AGTGGGTAGT CCTATCATAG ATGGAATGGG AGTGGGTAAT TCCCTTAGCC TTTATGCAGA AAGCTTAGTT GTAGATGGTT TTGTTCTTTG TAATGGGAGA AGTGGTTCAT ATGTTGTATC AGACAACAAT ATTCTAACGA ACAATACTTT TAAAGGCAAT CAATATGGGG TATATCTGTT TGGATCAAAA GGAAATGTTA TTGAACAAAA CGTTATCGAG CACAACCAAA GATATGGGGT GTATTTGCTT TTTAAGAGTG ATAATAATAT CATAACCGAT AATATGATCA ACAATAATGG TGGCGGGATA CGAATTATTT CCTCTGATGA TAATAAGTTG TATCTGAACA GCATCATTGA GAATGTCGTG ATCTCCAATG GAAATAATCA ATGGGATGAC GGTGTGGATA AAGGAAATCA TTATAGTTTC TTCGATGAAG AAAGTGAAGG TTTCATTGAT AAAGATCATG ATGATGTATC AGATGTTCCT TACAAGATAC CTGTAAAAAA TGAAGTTGAC AACTATCCTC TTGCAAGCAT AGGATCAACA CGACCAACAA TAGTTTTAGT AAAAGAGAAC CCTATAGAGC CTTCAAAGGA AACTTCTGAA GAGATCCCTG AATTTCCAAC AGTAGCATTT CCCATATTGC TTTTAATGGG AATATTTGTT GTGTTCAATA AGAAAACGAA TTCATGA
|
Protein sequence | MSINYKTNGI IKIILLPLFM AILMTQASAS ILEVGEGQEY SHIQDAVNNA NEGDKVIVHS GIYEENVILD KQIILQGVGS PIIDGMGVGN SLSLYAESLV VDGFVLCNGR SGSYVVSDNN ILTNNTFKGN QYGVYLFGSK GNVIEQNVIE HNQRYGVYLL FKSDNNIITD NMINNNGGGI RIISSDDNKL YLNSIIENVV ISNGNNQWDD GVDKGNHYSF FDEESEGFID KDHDDVSDVP YKIPVKNEVD NYPLASIGST RPTIVLVKEN PIEPSKETSE EIPEFPTVAF PILLLMGIFV VFNKKTNS
|
| |