Gene Msed_1808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1808 
Symbol 
ID5105371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1750058 
End bp1751308 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content40% 
IMG OID640507707 
Productglycosyl transferase, group 1 
Protein accessionYP_001191886 
Protein GI146304570 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTAG GCCTAGTTAC TTTTTCCTTG ATGAACCTAC GGGAAGGCGG AGCAGAGAGG 
CATGTTAGAG AATTTGTTAA TATTGCTAAG ACTAGATTTG AACTTGTTCT CTTTCCCACT
TTAAACACGT ATTTACAAGT TGAAAACGAG GAAGATAAAA ATGCCCTAAT CAAGAGAGCT
CAGGAGCTAG AGAAGGAGGG TATTACACTA GCCTCAGAGT TTTACTCCTT GGTTGACCGC
TCGATTACCA GGAAAGAGCG TCTCTTGAAT TTTGTGGACT TTCGTTTACT GCGCGAGTTA
AGTAAGGCCT ACTATTCCGA TCTAAACAAG ATTGATTTTC TTTTCTCCCC GAACTTTATA
TCTCCAGACG TAGTTCTCAT GGCTAGCCAC TCAGGAAAAG GTTATGGAAT TCTGATAAAC
GGGTATATAG CGCCCCTTCA CATGGATCCC CTCTTATATT CCATTTATAA GTTTAGGATT
GGTCAGGAGA GCTTTCTTAG CTCATTGCCT AAAAACCTTC TAATTTCTAA TTATTGGGCT
AAAGCCATTA AGATAATGAA AAATAATCCA CCTAAGTTCG TGGCAGGCGT AAATAGGGTA
GCTATTGAAG GAGTGATAGG TAAATTGCCA ACTAATCACG TCATTCTGGA TCCTGGATTT
GCAATTGATC CATCTATCGT GAAGTATAGG TCTGAAGCTA AGGATAACTA TGCAATCTTC
GCTGCAGCTA GAGTAGAGCC CAGTAAGGGG ATGCTAGATC TGTTGAGGAT CATGAAGCTC
CTTAAAAGTG CAGACGTGAA ACTAAAGATC ATGGGAAGGT TAAAATCAGA TTCTTCCAAT
TTTTATCGAT TAGCGGAACG TTATGGTGTA AGGGATAAGA TAGAATACCT AGGATTTCTA
TCTGGAGAGG AAAAGTACAA GGTCATGAGC TCTGCAAGGG TCATGGTTTA CCCGTCTCAT
GACGATACCA ATGCACTGGT TGTGATGGAA TCCCTTGCAG TTGGGACTCC TGTCGTTACC
TATGCTATTC CTGGCATTAA GTACGTGTAT GATGGTGTTC CTGGTGTAAC GCTAGTGAAA
GAGTTTAACT ATGACGGAAT GGCGAAAGAG ATCAACAAGA TCATGCATAC TCAATATTCG
TTAAATGATG AGAAGCTACA GAAATTACTC TCCCATCATA GTTCTTGGGC CAAGGTAGTC
GAACAAGAGG TTAACCTGAT ACTTAAATAT GGTATTAAAG ATATGAAATA A
 
Protein sequence
MRVGLVTFSL MNLREGGAER HVREFVNIAK TRFELVLFPT LNTYLQVENE EDKNALIKRA 
QELEKEGITL ASEFYSLVDR SITRKERLLN FVDFRLLREL SKAYYSDLNK IDFLFSPNFI
SPDVVLMASH SGKGYGILIN GYIAPLHMDP LLYSIYKFRI GQESFLSSLP KNLLISNYWA
KAIKIMKNNP PKFVAGVNRV AIEGVIGKLP TNHVILDPGF AIDPSIVKYR SEAKDNYAIF
AAARVEPSKG MLDLLRIMKL LKSADVKLKI MGRLKSDSSN FYRLAERYGV RDKIEYLGFL
SGEEKYKVMS SARVMVYPSH DDTNALVVME SLAVGTPVVT YAIPGIKYVY DGVPGVTLVK
EFNYDGMAKE INKIMHTQYS LNDEKLQKLL SHHSSWAKVV EQEVNLILKY GIKDMK