Gene Athe_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1558 
Symbol 
ID7409066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1649224 
End bp1650348 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content35% 
IMG OID643715930 
Productglycosyl transferase group 1 
Protein accessionYP_002573429 
Protein GI222529547 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.899676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG GCATTGATGG TAGAGCTGCT AAATGGTATA GGGGTTCCGG TATTGGCACA 
TACACCTATC AGCTTTTAAA TTATATCAAA AAACTCGACA AAGAAAATGA ATATTTAATA
ATCTGGCCTG ATAGCTGCGA AACAGAATTT GTACTTGCAC AGAATATAAA CATCAACCTT
CTTCCTCAGC AGTTAGATAA GTTCTGGGAA GAAATTATGA TAAAAGAGAT TATACTTCAA
AATGATATTG ATATATATCA TGTCCCTCAA AATGGTATTG GGCTTCCTCT GTCTAAAAAG
TGCAGTTACA TTATTACTCT GCATGACATA ATTCCTTTCA GGCTTCCTGA AACAGTTGGA
CCGGGCTATT TGAGAATATT TAGAGATGTC GTCCCAAAAA TTATAAAAAT AACCGACTGC
ATCATCACTG TATCTGAGTT TTCCAAAAAA GATATTTGCG AATACTTCGA CATCCATCCA
TCAAAAGTAT TTGTAACATA TTTAGCAGCA GAAGACATAT ATAAGCCCTT ACCAGAAGAT
GAAGTCAAAA CCTTTCTTTT GCAAAAATTT AACATTGACT TCCCATACAT CCTTTATGTA
GGTGGTTTTT CGCCGAGGAA AAATCTTAAA AGATTGGTCA AAGCTTATTC CTTAATAAGA
GAAAAAATCA AGCACATCCA TCTTGTTATA CCGGGGAAGT TCAGCAGAAG TTATGAGGAA
ATAAAAAATC TGGTTGAAAA TCTTAAGCTC ACTTCCCACG TACATTTTCT TTCTTATGTG
GATGTAGAAT TTATGCCATA TATATACAAC GGTGCTTTGC TTTTTGTATA TCCATCGCTG
TACGAAGGGT TTGGTCTTCC ACCACTTGAG GCAATGGCTT GCAAAGTCCC GACTATAGTA
AGCAACACAA CTTGCCTGCC TGAAGTTTTA AAGGATGCTG CCCTGTATGT TGACCCTTAT
TCCGAAGAAG ACATTGCTCA AAAAATACTG CTTGTATTAG AAAATGTAGA GCTGCGAAAT
CAGCTTGCTC AAAAGGGTTA TTTTCTTTCA AAAAGTTATA GCTGGGAAAA AACAGTTATG
AATACTTTGA AAATTTACCA GCAAATTTTT TCCTTGAAAT ATTAA
 
Protein sequence
MKIGIDGRAA KWYRGSGIGT YTYQLLNYIK KLDKENEYLI IWPDSCETEF VLAQNININL 
LPQQLDKFWE EIMIKEIILQ NDIDIYHVPQ NGIGLPLSKK CSYIITLHDI IPFRLPETVG
PGYLRIFRDV VPKIIKITDC IITVSEFSKK DICEYFDIHP SKVFVTYLAA EDIYKPLPED
EVKTFLLQKF NIDFPYILYV GGFSPRKNLK RLVKAYSLIR EKIKHIHLVI PGKFSRSYEE
IKNLVENLKL TSHVHFLSYV DVEFMPYIYN GALLFVYPSL YEGFGLPPLE AMACKVPTIV
SNTTCLPEVL KDAALYVDPY SEEDIAQKIL LVLENVELRN QLAQKGYFLS KSYSWEKTVM
NTLKIYQQIF SLKY