Gene Msed_0790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0790 
Symbol 
ID5105113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp723153 
End bp724181 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content49% 
IMG OID640506695 
Productglycosyl transferase family protein 
Protein accessionYP_001190889 
Protein GI146303573 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.68906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTACTCC AGTTCCTCCT GGTGTTGCCT GCACTGGCAG ATCTAGTCCT CCTTTTACAG 
ATCTGGAGGG AAAACTCAAT CTTCAAATTT GACGGTAAGT TCTGCGCTCC TGCATCCATC
ATTGTTCCCG TGAGGGGTCT CGACCCCGAA CTAGAGAGGA ACGTGGAATC GCTCAGGAAC
CAGGACTTTC CCTGTCCCTT CGAGATAATA TACGTGGTGG ATCCTGATCA ACCATGGTTG
GCTGAACGTC TGAGACGGCT TGGAGTGAAG GTCGTGATCA CCAGTTTTAC CTGTTCATGT
AGCGGTAAAA TAAGGGCACA ACTCTCAGGG CTAAGGGAGT CCGCGAATGA AGTGGTGGTC
TTCGCCGACT CCGACACGCT CTATCCTAGG AACTGGTTGA GGGAGATGGT GGGGAACCTT
GACAGGCACA TGGCTGTAAC CACGTTTTCA TGGCCCGCCC CCCTCAAAAT AACGTGGAGA
AACCTGATCA GGGCTGGCTT CTGGACATTG GGATTCGAGT CTCAGGCCTC TGGTGGGACC
TTCCTCTGGG GAGGCTCCAT GGCCTTCAGA AGAGATTTCT TTGATAGTGA GGTCCTGGAA
GAGCTTTCGC GTGAATGGTG TGACGACTGC ACCCTCACTA GGATAGTGAA AAAGCGAGGA
GTAAGTATCG CCTTCGACGG TAAAGCCATC CCACTCAACA TTTATGACGA GAGAGACCTA
TGGAAATGGT CCACAAGGCA GGTCGTCACG ATCATCAAGT ACTCTAGCAG AGGAGCCAAG
GCCTTCCTGG TGATAGGTGC TCTCATGCTT GCCTTTCCAA TCCTTTTCCT TGTCTTCTTG
AACCCATTCT ACCTGTCTCC TCTGCTTCTA TGGATTCTGA AAAATTTCTC CAGAAGTAGA
AATCTGGGGA AATATTCATA TACCCCATCT GTCATGTCAA TTTTAGGTGT ATATTACGGG
TGGATCAAGC TAATCCTTGA CTACAGGAAA AGGACAGTCG TTTGGAGAGA CAGGGTCTAT
AATCTTTAA
 
Protein sequence
MLLQFLLVLP ALADLVLLLQ IWRENSIFKF DGKFCAPASI IVPVRGLDPE LERNVESLRN 
QDFPCPFEII YVVDPDQPWL AERLRRLGVK VVITSFTCSC SGKIRAQLSG LRESANEVVV
FADSDTLYPR NWLREMVGNL DRHMAVTTFS WPAPLKITWR NLIRAGFWTL GFESQASGGT
FLWGGSMAFR RDFFDSEVLE ELSREWCDDC TLTRIVKKRG VSIAFDGKAI PLNIYDERDL
WKWSTRQVVT IIKYSSRGAK AFLVIGALML AFPILFLVFL NPFYLSPLLL WILKNFSRSR
NLGKYSYTPS VMSILGVYYG WIKLILDYRK RTVVWRDRVY NL