Gene Athe_1864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1864 
Symbol 
ID7408977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1957841 
End bp1959541 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content32% 
IMG OID643716236 
Productglycosyl transferase family 39 
Protein accessionYP_002573725 
Protein GI222529843 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1928] Dolichyl-phosphate-mannose--protein O-mannosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000040084 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGGAA AAAATGTATT ATTATTACTC TCAATTCTGA TTGCCACAGG ATACTTTATT 
CTTCAGGTTA TTTCCATTGG CTCAAGAGAA ATACCAACAA CATATTGGGC ATCCAAAGAA
GAGGGCAGTA CAATTGAAGT GAGAATTTCA CCTAAGATGT TTATAAAAGA CTTCTTTATC
TATTTTGGTT ATGGAGACGG GAAAGTTGAT ATACAATTTT TGAACGATAA CTCGGTAATA
TTCCAACAAT CTGTCCAATC AGCCTTTTTT CAAGGTAAGG TAATATTAGT TAATAACGTC
GTTGACAAAG TGCTTATCGT GACACATGGA AATGGTCTTG AGATTAGAGA AATTGCAGCA
AAAAAAGATG ACAAACAGTT TATAGATTTT TCAAAAGCAG AAATTGTTAT TTTAAAAGGC
TTCAAATATT TAGGAAATCC TTATAATTTA GTTGATGAGC AAGCTAAAAT GAGAAGTATA
GTAAGTTACC GTTACAGTTC ATATTTTGAT GAGATATATC ATGCAAGAAC AGCCTACGAA
CTGTTAAAAG GGCTGCCACC TTATGATTTA GTTCATCCAC CCTTAGGTAA ATGGCTAATA
TCAGTTGGAA TGGCAGTATG GGGAGTGAAC CCATTTGGAT GGAGAATTGT TAATTTAATT
TTTGGTTCAA TTGCATTGGT TCTTATATTA ATTTTATTTA CTAAGCTACA CAAACCCTCA
TTTTGGTGCG GAATAGCAAT TATAATATTG ATGGCAAGTG ACTTTTTGCA TAATAGCTTG
TCGAGAACAG CAAACCTTGA TACATTCAGT TTGTTTTTTA TTCTTTTATG TTCAATTTTT
GGAATGTCTT ACATTAGTAG CATACTAAAA AAGAAAGAAA AATTATCTAA AACGAATTTA
GCATACTTTT TGACATTTTC AACTGGAGGG TTAGCATTTG CTTGTAAATG GAATGCTTTG
TATTCAATTA TACCTATTTT AACAATTTCA TTCTCATATA GAGTGCATAA TCTTATAAAA
AATAATGACA AAAATTGGGT TGTAAAGGTG ATCAAAAATG GATTATTATC CATCATAGCT
TTTTTAATAC CTTATTATCT TACATACCTT CCAATAACAA TAAAATACCC ATATCATAAT
TTACCAGGGG CAGTTATAAG TGATTTTATA ATGTTGCAAA ACCATATTTG GAAATACCAT
TCGACATTAG TAGCAACGCA TCCATTTTCT TCTGAGTGGT ATCAATGGTT ATTGGCAACG
AAACCACTAT GGGCTTATTA TGACAATTCG TTGCCAAGTA ATTTACGTTC GACTATTGCA
TATTTAGGGA ATCCAGTCAT ATGGGGTCTG GGGTTGTTAG CACTGGCATA TTTATTGATA
GTGGCTCTGA AAAATCCAAA AGACAATTTG AGTTCTTTTA TAGTAATTAC AAGCTATATA
TCTTCAATTG TTCCATGGAT GTTTATAGGT AGAATTAAGT TTATATATCA TTATTACCTC
GCATTACCTT GGCTTTACAT AGCGATAGCT ATGGCAATTG ATAATCTAAG ACTAAAGCAG
GGATTAAAAG AAAAAGTTGC AATGACAGTA AGTAGTTTAG CTCTGATTAT GCTTATCATA
TACTATCCTG CTGTGAGCGG TTTGACTGTG TCAGCAAAAT ACATTAATAT GCTAAAGATA
ATGAAGAGTT GGATATTCTA A
 
Protein sequence
MKGKNVLLLL SILIATGYFI LQVISIGSRE IPTTYWASKE EGSTIEVRIS PKMFIKDFFI 
YFGYGDGKVD IQFLNDNSVI FQQSVQSAFF QGKVILVNNV VDKVLIVTHG NGLEIREIAA
KKDDKQFIDF SKAEIVILKG FKYLGNPYNL VDEQAKMRSI VSYRYSSYFD EIYHARTAYE
LLKGLPPYDL VHPPLGKWLI SVGMAVWGVN PFGWRIVNLI FGSIALVLIL ILFTKLHKPS
FWCGIAIIIL MASDFLHNSL SRTANLDTFS LFFILLCSIF GMSYISSILK KKEKLSKTNL
AYFLTFSTGG LAFACKWNAL YSIIPILTIS FSYRVHNLIK NNDKNWVVKV IKNGLLSIIA
FLIPYYLTYL PITIKYPYHN LPGAVISDFI MLQNHIWKYH STLVATHPFS SEWYQWLLAT
KPLWAYYDNS LPSNLRSTIA YLGNPVIWGL GLLALAYLLI VALKNPKDNL SSFIVITSYI
SSIVPWMFIG RIKFIYHYYL ALPWLYIAIA MAIDNLRLKQ GLKEKVAMTV SSLALIMLII
YYPAVSGLTV SAKYINMLKI MKSWIF