Gene Msed_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2101 
Symbol 
ID5104395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2022219 
End bp2023367 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content46% 
IMG OID640507991 
Productcobalamin biosynthesis protein CbiD 
Protein accessionYP_001192165 
Protein GI146304849 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1903] Cobalamin biosynthesis protein CbiD 
TIGRFAM ID[TIGR00312] cobalamin biosynthesis protein CbiD 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.753139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTGTAA GAATACCGCT AAACTATCAG CCAAGCTCAC CCTCACTGTC CAACGCTAAA 
TCGGCCAAGC GAAAAATCCT CAAGATGGGA TTTACGACTG GAACCGCTCT TGCGGCCGCG
GCTAAGGCTT GTGCCTATGC CATAACTGGA GAGATTAAGA AGGCGGTAGT GGTGTCAACC
CCCATTGGAC TCAGAATAGA GATTCCGCTA AATTACGTAA AAATGGAGGA AGAGTGGTGC
GTAGCCTCAG TAACTAAGTA CGGTGGCGAT GATCCGGATG ACACAAACGG AATGGAAATA
GTGGTAAGAA TGAGATTGAA TGACGACAAG GGATACATAG TGGTGAGGAC AGGTAAGGGG
CTTGGAAAGG CGGTCAGTAA GGGGTTACCT GTGAACCCTG GAGAACCTGC CATAAACCCA
GTCCCTAGGA AACAACTAAT CGACAATCTA AAGGAGGTTC TCGGGGATAA TTTTGGGGCA
GAAATAGAGG TGATAATTCC AGAAGGAGAG AGAATCGCAA GGAGGACCTT CAATCCTAAG
CTTGGGATAG AGGGAGGAGT CTCCATCCTA GGAACCAGTG GGGTGGTGAA GCCTATGAGC
CTCGTGTCCT GGTACGCTTC CCTAGTGGAG CAGTTAGATA TTGTAAAAAC CTATGGGATA
GATCAGGTTG TACTGGTACC AGGGAATATA GGTGAGACCA GCGCCAGGAG AAAGCTCAAC
GTTGATTCCA GGTCAATAGT TCAAATGGCC ATATTCACGG GAGGTATGTT GAAGGCTTCG
GCGAAGAGAG GATTCAAGGA AATTCTGCTC TATGGGCATG TGGGGAAATT AATTAAGAGC
GCCATAGGCA TATGGAACTC CCACTATAAG TATGGGGATG GTAGATTAGA GATAATAACT
GCCTACGCAG CCAAGCACGG GGTTAAACAC GACGATATCC TCAGGCTAAT CAACGCTAAA
ACTACGGATG AAGCTATTGC CATTTTGAAA GACTATGACT ACAAGGAGAT ATTTAATGAA
ATAGCGGATA AGATATCCCT TAACTCCCAT AACCTTATTG AAGGAAAGGC AAAGGTATAC
TGCATTCTAA TAAACATGGA AGGGGAGATA GTTGGATTAT CAACCGGGTC AGAGAAATTC
TTGGCCTAA
 
Protein sequence
MSVRIPLNYQ PSSPSLSNAK SAKRKILKMG FTTGTALAAA AKACAYAITG EIKKAVVVST 
PIGLRIEIPL NYVKMEEEWC VASVTKYGGD DPDDTNGMEI VVRMRLNDDK GYIVVRTGKG
LGKAVSKGLP VNPGEPAINP VPRKQLIDNL KEVLGDNFGA EIEVIIPEGE RIARRTFNPK
LGIEGGVSIL GTSGVVKPMS LVSWYASLVE QLDIVKTYGI DQVVLVPGNI GETSARRKLN
VDSRSIVQMA IFTGGMLKAS AKRGFKEILL YGHVGKLIKS AIGIWNSHYK YGDGRLEIIT
AYAAKHGVKH DDILRLINAK TTDEAIAILK DYDYKEIFNE IADKISLNSH NLIEGKAKVY
CILINMEGEI VGLSTGSEKF LA