Gene Athe_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2066 
Symbol 
ID7408775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2183550 
End bp2184677 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content31% 
IMG OID643716433 
Productglycosyl transferase group 1 
Protein accessionYP_002573916 
Protein GI222530034 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.033296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTT TGCACCTTAT AAGCGGTGGT GATACAGGGG GAGCAAAGAC ACATATAATA 
AACCTGTGTT CAAAACTAAA AGATCTTGTC AGTCTTAAAA TTATATGTTT CATGTACGGG
CAATTCTATG AAGAGGTAAA AAATGCTGGA ATTGATATAG ATGTTATTCA ACAATCCTCT
CGATTTGACT TGAGTGTGGC TGACAGAATT GCGCATATCG TTAAAGCTGA AGATTATGAT
ATAATTCACT GTCATGGTGC AAGAGCAAAT TTTATTGGAA TGTTTTTAAA ACGAAAAATC
AAAAACAAGC CATTTATCAC AACAGTACAT AGCGACTTTG ATTTAGATTT TCAGGACGTT
TTTTATAAAA GAGTAGTGTT TTCATTTCTT AATAAACTTT CTTTAAAAAG ATTTGACTAT
TTTATTTCTG TGGGATCTGC ATTGATTGAC AAAATAAAAG GACTTGGGGT GAAAGAAAAT
AGAATTTTTC TTTTGTACAA TGGTTTTGAC TTTTCAAAAG AGATACATTA TGTGCAAAAG
GATGAATTTT TATCAAAGTT TTTTGATAGA AAAGTATTTG ACTCCAAAAT AGTTATAGGA
AACTTGAGCA GGTTATACAA GGTAAAAGGT TTAGATGTAT TTATAAAAGC CGCCAATATA
ATAGCTAAAA AATATCCTGA GGTCATTTTT TTAATCGGCG GAAGTGGTCC TCAAAAGGAA
TTTTTAAAGC AAATGATAAG TGAATACAAT TTAAATGACA GGGTATTTCT ACTTGGCAGT
ATAAAAAATC CATATGACTT TTTTAATAGC ATAGATATAA ATGTCATAAG TTCATACTCT
GAAACTTTCC CATATTCAAT CTTAGAAGCA ACAGCACTTG AAAAGTGTTG TATATCAAGC
AAAGTGGGTT CAGTGCCAGA CTTGATTGAA GATGGTAAAA ATGGTTTTTT ATTTGACGCT
GGAGATTATA AAGGGCTTGC TCAAAAGATA GAAATTCTTT TGCAAAATAA AGACCTTATC
AAAGAATTTG GACAGCTTCT TTCTAAAAAA GCAAAAGAAA AGTTTTCTGC AGAAAATATG
GCAAGGATGC AATTTGAGAT TTATAAAAGC ATACTTTCGA AAAAATAA
 
Protein sequence
MKVLHLISGG DTGGAKTHII NLCSKLKDLV SLKIICFMYG QFYEEVKNAG IDIDVIQQSS 
RFDLSVADRI AHIVKAEDYD IIHCHGARAN FIGMFLKRKI KNKPFITTVH SDFDLDFQDV
FYKRVVFSFL NKLSLKRFDY FISVGSALID KIKGLGVKEN RIFLLYNGFD FSKEIHYVQK
DEFLSKFFDR KVFDSKIVIG NLSRLYKVKG LDVFIKAANI IAKKYPEVIF LIGGSGPQKE
FLKQMISEYN LNDRVFLLGS IKNPYDFFNS IDINVISSYS ETFPYSILEA TALEKCCISS
KVGSVPDLIE DGKNGFLFDA GDYKGLAQKI EILLQNKDLI KEFGQLLSKK AKEKFSAENM
ARMQFEIYKS ILSKK