Gene Msed_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1590 
Symbol 
ID5103954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1536899 
End bp1538305 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content44% 
IMG OID640507477 
Productglycosyl transferase, group 1 
Protein accessionYP_001191669 
Protein GI146304353 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00816719 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0063014 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTATCGC CAGTAACGAC CAGGGACATT CCTCTCTATT CTCGTGGGTT CTTCATCTTT 
ATAACAGGTA GTTACGTTAC TTACTGCATG AGAATTGCCA TTAATACTCA AACTCCTCCA
ATCCGTTTCA ACTTCACCTA CAAGGAACTC TTGGAGAGAT ATGGAGAAGT TCCAATCCCA
CTGGATCTGA GCTCACTTTC TGACCAAGAT TATCAGATCT CAGTGGGCGG AGTAGCTAGG
ATGATGATGT CCCTAGTCAA AACCTCAGGA TTTGAGAGGA GCAGATGGGT AGCCTTAGGA
CCTGGATACC CTCCTCAGGT TGTGCTGGAA GGAATAGATA TTCACTTCAT CGATTTACCT
CCAACTCTTC TATCCAGTTA CACCAGATTC AAGGAAGGAT TATACAATGA AGCTCACGGC
ATGTACAACT ATAATATCGT TGGCTCAGAC TACATAGCTT ATGCCTCCTA CAACTGGCAT
TCTGCCCTTA AACTCCTGGA ATTCTATTCT GATACAGACG TATATCTGAT AAATGATTTC
CAGCAGTTAC TCGTGGGAGG GATAATTGGC CCATCAGCTC CCGCAGTCCT ATGGTACCAT
ATTCCCTTTG TGCCAGAGAG GTTAAGTGAT AGGGTCAGAG AGTTCCTAGT TAGGGCATTT
GAGGGTTTCG ATCTGATCAT ATCCAGTACA AAACGTGACC TAGAGGGCTT AGTCAGATCA
GGTGCCAAGG TGAAAGCTAG ACAAATATAT CCCTTCATAG ATCCCTCAGC TTACTCCCAC
GTAAGCAAGA GCGAAGTCGA ATCTGTTTCA GAGAAATTTG GGATCAAAGA GGATGATAAG
GTAGTCCTGT TAGTGGGAAG AATGGACCCG ATAAAGAGTC AGGACATAGC GATTAAGGCT
ATCAAAAACA CCAACTTGAA GCTGGTAGTG GCAGGGAACG GAAGTTTCAC CAGTAAGAGC
CTTGGCCATG ATAAGGCTAG TATTTGGGCA AGAAAGCTAA GGGACTTAGT GGTGGAGCTA
GGAGTTCAGG ATAAGGTCAT TTTCACAGGA TATGTGAGCG ACCATGAACT AATGGCTTTA
TATCAAAGAA GCGACGTGGT CACTTTGACC TCAAGAAGCG AGGGGTTTGG GTTAACAATA
TGTGAAGCGT GGAATTACAA GAAGCCAGTG GTAGTAAGCG AGGGAGCCGG CGTAAGTGAG
TTGATCATAA ATGGGGTGAA TGGATACACT TTTTCGCCTG ATAAACCAGA TCAAATGTCA
TGGGCTTTGG TGGAAGCGGT AAAGAATTAC GACAAGTTAG GTCAAAGAGG TTATGAGACA
TTGCCACAAT GTTCCGTACA AACAGCGTCA GGAAAAGTAA AGGAAGCATT AGAGGAGGCT
CAAAGAGGTT ACATTCATAG CAAGTAA
 
Protein sequence
MLSPVTTRDI PLYSRGFFIF ITGSYVTYCM RIAINTQTPP IRFNFTYKEL LERYGEVPIP 
LDLSSLSDQD YQISVGGVAR MMMSLVKTSG FERSRWVALG PGYPPQVVLE GIDIHFIDLP
PTLLSSYTRF KEGLYNEAHG MYNYNIVGSD YIAYASYNWH SALKLLEFYS DTDVYLINDF
QQLLVGGIIG PSAPAVLWYH IPFVPERLSD RVREFLVRAF EGFDLIISST KRDLEGLVRS
GAKVKARQIY PFIDPSAYSH VSKSEVESVS EKFGIKEDDK VVLLVGRMDP IKSQDIAIKA
IKNTNLKLVV AGNGSFTSKS LGHDKASIWA RKLRDLVVEL GVQDKVIFTG YVSDHELMAL
YQRSDVVTLT SRSEGFGLTI CEAWNYKKPV VVSEGAGVSE LIINGVNGYT FSPDKPDQMS
WALVEAVKNY DKLGQRGYET LPQCSVQTAS GKVKEALEEA QRGYIHSK