Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2101 |
Symbol | |
ID | 5104395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2022219 |
End bp | 2023367 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507991 |
Product | cobalamin biosynthesis protein CbiD |
Protein accession | YP_001192165 |
Protein GI | 146304849 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1903] Cobalamin biosynthesis protein CbiD |
TIGRFAM ID | [TIGR00312] cobalamin biosynthesis protein CbiD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.753139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTGTAA GAATACCGCT AAACTATCAG CCAAGCTCAC CCTCACTGTC CAACGCTAAA TCGGCCAAGC GAAAAATCCT CAAGATGGGA TTTACGACTG GAACCGCTCT TGCGGCCGCG GCTAAGGCTT GTGCCTATGC CATAACTGGA GAGATTAAGA AGGCGGTAGT GGTGTCAACC CCCATTGGAC TCAGAATAGA GATTCCGCTA AATTACGTAA AAATGGAGGA AGAGTGGTGC GTAGCCTCAG TAACTAAGTA CGGTGGCGAT GATCCGGATG ACACAAACGG AATGGAAATA GTGGTAAGAA TGAGATTGAA TGACGACAAG GGATACATAG TGGTGAGGAC AGGTAAGGGG CTTGGAAAGG CGGTCAGTAA GGGGTTACCT GTGAACCCTG GAGAACCTGC CATAAACCCA GTCCCTAGGA AACAACTAAT CGACAATCTA AAGGAGGTTC TCGGGGATAA TTTTGGGGCA GAAATAGAGG TGATAATTCC AGAAGGAGAG AGAATCGCAA GGAGGACCTT CAATCCTAAG CTTGGGATAG AGGGAGGAGT CTCCATCCTA GGAACCAGTG GGGTGGTGAA GCCTATGAGC CTCGTGTCCT GGTACGCTTC CCTAGTGGAG CAGTTAGATA TTGTAAAAAC CTATGGGATA GATCAGGTTG TACTGGTACC AGGGAATATA GGTGAGACCA GCGCCAGGAG AAAGCTCAAC GTTGATTCCA GGTCAATAGT TCAAATGGCC ATATTCACGG GAGGTATGTT GAAGGCTTCG GCGAAGAGAG GATTCAAGGA AATTCTGCTC TATGGGCATG TGGGGAAATT AATTAAGAGC GCCATAGGCA TATGGAACTC CCACTATAAG TATGGGGATG GTAGATTAGA GATAATAACT GCCTACGCAG CCAAGCACGG GGTTAAACAC GACGATATCC TCAGGCTAAT CAACGCTAAA ACTACGGATG AAGCTATTGC CATTTTGAAA GACTATGACT ACAAGGAGAT ATTTAATGAA ATAGCGGATA AGATATCCCT TAACTCCCAT AACCTTATTG AAGGAAAGGC AAAGGTATAC TGCATTCTAA TAAACATGGA AGGGGAGATA GTTGGATTAT CAACCGGGTC AGAGAAATTC TTGGCCTAA
|
Protein sequence | MSVRIPLNYQ PSSPSLSNAK SAKRKILKMG FTTGTALAAA AKACAYAITG EIKKAVVVST PIGLRIEIPL NYVKMEEEWC VASVTKYGGD DPDDTNGMEI VVRMRLNDDK GYIVVRTGKG LGKAVSKGLP VNPGEPAINP VPRKQLIDNL KEVLGDNFGA EIEVIIPEGE RIARRTFNPK LGIEGGVSIL GTSGVVKPMS LVSWYASLVE QLDIVKTYGI DQVVLVPGNI GETSARRKLN VDSRSIVQMA IFTGGMLKAS AKRGFKEILL YGHVGKLIKS AIGIWNSHYK YGDGRLEIIT AYAAKHGVKH DDILRLINAK TTDEAIAILK DYDYKEIFNE IADKISLNSH NLIEGKAKVY CILINMEGEI VGLSTGSEKF LA
|
| |