Gene Msed_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0129 
Symbol 
ID5104982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp105029 
End bp106222 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content44% 
IMG OID640506030 
Productglycosyl transferase, group 1 
Protein accessionYP_001190230 
Protein GI146302914 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.730138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000955625 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATAGAAA AATACGCTGA AATAATAGGA GAAAACGAGC TAGATGCAAT CTACAGAATA 
GCTGAGAAGT TGAAGGGATT GTCAGTTCTT CATGTTAATT CAACACCTAA GGGCGGTGGA
GTAGCTGAAA TCTTGAGCAA ACTTGTCCCT ATGATGAACG AACTGGGGAT AAACACTCAA
TGGAAGGTCA TAAGGGGGGA CAATGAGTTC TTTAACGTAA CAAAGTCATT TCATAACTCC
CTACAGAACG GTTCGGGAAA TGTAAATGAT GAGACTTTCG CAGTTTACGA GAAATGGCAA
GATATTAATG CATCGGAAAT TCCGCTGGAC TACGATGTGG TCTTCATACA CGATCCTCAG
CCTGCAGGAT TAATCAAGGT AAGAAAGAAG GGAAAGTGGA TATGGAGATG TCATATTGAT
ATTTCCAATC CATATCCAAA GGTTTGGGAA TTCCTTCGTA AGTATGTTCA GCAGTATGAC
GCCATGATAA TATCGTCATC GGTATTTGGA AGAGAGGACC TTACCATCCC ACAGTACGTG
ATTCCGCCGT CGATAGATCC CCTAAGCGTT AAGAACAGAG AGATATCACG ATTTACTGTC
GAGAGGATCC TAAGGAAGTT TGAGGTTGAT CCTGATAGGC CACTGGTGAC TCAGATTAGC
AGATTTGACC GGGCAAAGGA TCCAATAGGA GTAATATCTG CCTTTAAGGG GCTAAAGAAG
CACATAGATG CTCAATTGGT CTACTTGGGA AGCCCCGCAT CTGACGACCC CGAAGGGGAT
CAGGTATATC AGGAGACGGT AAAGGCAGCT GAAGGAGTGA AAGATGTTCA TCTCTTAATG
CTACCCCCCG ACAGTGATCT GGAGGTTAAT GCGTTCCAAA GGGGAGCAGA CGTCGTAATG
CAGAAATCGA TTAAGGAAGG CTTCGGTTTA ACCGTAAGCG AGGCCATGTG GAAAAGGAGA
GCAGTAATTG GTGGTAATAC GGGAGGAATT CCCTTACAGA TAATCCATGG TTATACGGGA
TTCCTAGTGA ATACGCCCGA GGAGGCCACA CATTATCTCA TTTACTTACT GCGAAATAGA
CAAATAAGGG AGAAGATAGG TCAGGACGCT AGGGAGCATG TTAGAAATAA CTTCCTGATG
ACTAGGGAAC TTCGGGATTA CCTAATGGTA ATGTTATTGT CAATTCAAGG CTGA
 
Protein sequence
MIEKYAEIIG ENELDAIYRI AEKLKGLSVL HVNSTPKGGG VAEILSKLVP MMNELGINTQ 
WKVIRGDNEF FNVTKSFHNS LQNGSGNVND ETFAVYEKWQ DINASEIPLD YDVVFIHDPQ
PAGLIKVRKK GKWIWRCHID ISNPYPKVWE FLRKYVQQYD AMIISSSVFG REDLTIPQYV
IPPSIDPLSV KNREISRFTV ERILRKFEVD PDRPLVTQIS RFDRAKDPIG VISAFKGLKK
HIDAQLVYLG SPASDDPEGD QVYQETVKAA EGVKDVHLLM LPPDSDLEVN AFQRGADVVM
QKSIKEGFGL TVSEAMWKRR AVIGGNTGGI PLQIIHGYTG FLVNTPEEAT HYLIYLLRNR
QIREKIGQDA REHVRNNFLM TRELRDYLMV MLLSIQG