Gene Msed_1584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1584 
Symbol 
ID5104029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1531867 
End bp1532817 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content47% 
IMG OID640507470 
Productglycosyltransferase family 28 protein 
Protein accessionYP_001191663 
Protein GI146304347 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0306453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00652045 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGAGGC TGCTCATTAT AGCGAGCGGA GGGGGACATA CAGGATTTGC CAAGGCCATC 
GCAGAGTATT TACCCTTTAA GGTTGACTTC GTGATTCCTG AGGGCGACGA AAACTCCAGG
AAGCTCTTGA GCCCTCACGC TGAGAAGATA TACGAGGTTA GTAAGCCTAG GGATCCGAAG
GGCCCAAACA CTTCCTTGGT AACAAGGGGG TTTAGGGCGC TCTCGCAATC AATTTCTCTT
CCAAGTTATG ACGTGGTCAT TGCTACTGGC TCGAATCATT CAGTGATACC TTCTTTATTT
CAAAGATTGA GGGGTTCCAC TCTCTTCACG CTAGAAAGCC AGGATAGGAT TGTGACCAAG
GGAAAGGCAG TGTCCGTGTT ATCGCACTTC TCCAAGGGAG TGTTCCTTCA CTGGAAGGAG
CAATCTAGGC TTTATAAGAA CGGGATAGTG GTGGGGCCCA TTCTTCAAAG GAGAAAATAT
GATCCAGTGG ACGAGGGTTT CATCCTAGTC ACCGCGGGGA CCGAAGGTTT CAAAGCCCTT
TTTGACAGGA TTTCCAGTCT AGGGCTAACA AATCTGGTAA TGCAGACTGG AAAGATATCT
CCTGAGCCAT ACGTGAAAAG TGGCGTAAAA GCATTCAGCT TTGATCCAGA TATTGAGAGA
TTTATCGCAG GAGCTTCCCT GGTAATAACT CATCAGGGTA AAACAGCCAT GGAGTCAGCC
GTGCTCTATG GGAAACCCAC AATCATTGTC TTTAACAAGA GCCTCACCAG GGCTGCTACC
CACGAGGATG TGAAACTATA CTCCGAAATA ATAGGTGCAG AATTTCTAGA CGATCCATCC
ACTTGGGAAG ACGAGGAATT GCTTAAGGCC ATTCAGAAGC GTAAAAAGCC CAACTATTAT
GAACCTGGGA CAGAGAGGTT AGTGAAGGTG ATCATGGATT ATCTTGAATG A
 
Protein sequence
MKRLLIIASG GGHTGFAKAI AEYLPFKVDF VIPEGDENSR KLLSPHAEKI YEVSKPRDPK 
GPNTSLVTRG FRALSQSISL PSYDVVIATG SNHSVIPSLF QRLRGSTLFT LESQDRIVTK
GKAVSVLSHF SKGVFLHWKE QSRLYKNGIV VGPILQRRKY DPVDEGFILV TAGTEGFKAL
FDRISSLGLT NLVMQTGKIS PEPYVKSGVK AFSFDPDIER FIAGASLVIT HQGKTAMESA
VLYGKPTIIV FNKSLTRAAT HEDVKLYSEI IGAEFLDDPS TWEDEELLKA IQKRKKPNYY
EPGTERLVKV IMDYLE