Gene Athe_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1623 
Symbol 
ID7409453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1723565 
End bp1724725 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content36% 
IMG OID643715992 
Productglycosyl transferase group 1 
Protein accessionYP_002573490 
Protein GI222529608 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0932889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATAA AGCCAGTGTT TGTAAGCACA TATCCTCCGA GAGAATGTGG TATTGCCACT 
TTTACGCAGG ATTTGGTAAA TGCAATTGAA AAATATAATG ATGTAAGACG TTGTTATATA
ATTGCTCTAA GCAAAGACAA GCATATTTAT GACAATAGAG TGCTATATGA TATTAATCAA
GACAAATTTT CAAATTATGT AAAAGCAGCT AATCTTATTA ATATGTCGGA CATTGATGTT
GTGGTAATTG AGCATGAATA TGGAATATTT GGCGGAGAGG ATGGAGATTA TATAATCCCA
TTTGTGAAGC TTATCAAAAA ACCAATCATT ACAACATTCC ACACAGTTTT GAAAAACCCA
ACTGCAAAAC AATTTGAGAT ACTGAAAAAG TTGGCAGATG CAAGTTTCAA GGTCATAACA
ATGGCTAAAA CCACCAAGGA TATTCTCATG GAAGTGTATG ACATAGAAGA AGAAAAGATC
GAAATTGTTC ACCATGGTGT ACCATATATG GAGCTTGATG ATAGAGAGAC TTTGAAAGAA
AAGTATGGAT TAAAAGGTAG AAAAGTGATA TCCACGTTTG GACTTATTAG TCCTGGCAAG
GGTTTAGAGT ATGCGATTGA AGCAATGAAC AAGGTAAGAA AAGAATTTCC AGAGGCTGTG
TATCTGATTT TGGGTCAAAC ACATCCAAAT ATCAAAAGAA TAAAAGGCGA AGAGTATAGA
GAAAAACTTA TGAATATGGT AAAAGAATTA AAGTTAGAAA ACAATGTCCA GTTTGTTGAT
AAATATCTAA CAAAAGTAGA AATAATGGAA TATTTGCGCC TGAGCGACAT TTACCTGACT
CCTTACATAG GGAGGGAGCA GGCTGTATCA GGGACACTCG CATATGCTAT CGGTTCTGGC
AAAGCTATAG TTTCGACGCC TTATACTTAC GCTCAGGAAA TGCTCTCGGA TGGCAGAGGT
GTTCTTGTAG AGTTTGAAGA TGCACAGTCG ATTGCAGATG GAATATTGAT GCTTCTAAGA
GATGAAAATC TCAGGAAAGA GATTGAAAGA AAGACATTGG AGATTGGCAA AGAAATGTAC
TGGCACAATG TCGCAAAGAG AATGATAGAT ATATTCTATG ATGTTGTTGA AATAAACAAG
AAGGTAGGGG TGATAGCATG A
 
Protein sequence
MAIKPVFVST YPPRECGIAT FTQDLVNAIE KYNDVRRCYI IALSKDKHIY DNRVLYDINQ 
DKFSNYVKAA NLINMSDIDV VVIEHEYGIF GGEDGDYIIP FVKLIKKPII TTFHTVLKNP
TAKQFEILKK LADASFKVIT MAKTTKDILM EVYDIEEEKI EIVHHGVPYM ELDDRETLKE
KYGLKGRKVI STFGLISPGK GLEYAIEAMN KVRKEFPEAV YLILGQTHPN IKRIKGEEYR
EKLMNMVKEL KLENNVQFVD KYLTKVEIME YLRLSDIYLT PYIGREQAVS GTLAYAIGSG
KAIVSTPYTY AQEMLSDGRG VLVEFEDAQS IADGILMLLR DENLRKEIER KTLEIGKEMY
WHNVAKRMID IFYDVVEINK KVGVIA