Gene Athe_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0454 
Symbol 
ID7407532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp516553 
End bp517818 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content38% 
IMG OID643714842 
Producthypothetical protein 
Protein accessionYP_002572359 
Protein GI222528477 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTCAACC CATTTGAAGA AAAGCCAAAG CCGATTGAAA ATTTTCTCAT GGACTGGAAA 
ACAATATATC CCAAGTCATA CTGCAAAAAT GAAGTTGACC CTTATACAAA AACAAGAATA
ATCCTCATGA ACGGGATTGA AGCTGAAGCC TCGATGTTTT CCCATCAGTT CCATCGTCAC
TGCACAGACA ACGACGTAAG ACGCGATTTA GCAATGATGA GAAGAATTGA GCAGATGCAA
CAAAAACACA TTAATTGGTT AAAGCCAATC GACGAGACAC CTCTTGAAAC TACAATAGGA
TATGAGCATG TAGCAGTAGA CCTTACCGCT TGGCTTGCTC AAAACGAGCC TGACCCTTAT
GTTAAACAAA CTCTTGATTT TATTTTGCTT GAAGATTTTG ACCATCTTTA CCGCTATGCA
AACCTTCTTG ACATGGACTC GAATATCCCA GCAAACAATC TGGTAAAAAA CTATGTTGAA
ATCATCCCTG GTCGGCCGAC AATTGCCCAT CACAGACACC CTTATGATAC TGTCAGCAGA
CACATTGATT TTAAAAAAGC AGATATTAGA ACAAAGTTAA ACATCTTCAT CATAACAGCC
GGTGAGCAGC AGACAATGAA TTTTTATAAC AACATTGGCA ACACTTATTA CAACAACCTT
GGAAGACAGC TCTACTTAGA AATAGCCTTG GTGGAAGAAG AACATGTAAC CCAGTATGGT
TCGTTGATTG ACCCGAGACT CACATGGTTT GAAAGTCTTC TTTTGCATGA ATACACAGAA
TGCTATCTGT ATTACTCATT CTATGAATCT GAGGTTGACT CAAATGTAAA ATCTATATGG
GAGATGCATC TTGATGCTGA AATTGCCCAT CTTCACAAAG CTGCAGAACT TTTGCAAAAA
CATGAGAATA AATCTTGGTG CGAGGTCATC CCTGGTGGAC AGTTCCCAAA TCTTTTGCTG
TTCCATGACA CGCGTGAATA TGTACGCAAA GTTCTTTCAC AGATGGTTGA AATTACAGCT
GACAAAGAAG ATTTAAAAAA TGTAAATGAT CTACCAGAAA ATCATACGTT TTTCTGGTTC
CAGAATAAAG TGAACCATGA TATAAATTCC GTTGCAAGCC ATGTGGTGAT TGATAAGCAC
AACACTTTAA AAGGCGAAGA TTACAGAGTT GAATTGGCGC CACACCCAGT TGAAAGCTTG
AGAAATAGAA AAGAAGATAA TACTTCAATT GGCAGGTCAA AACAGAAAAA AGTTGTTAGT
ATATAA
 
Protein sequence
MFNPFEEKPK PIENFLMDWK TIYPKSYCKN EVDPYTKTRI ILMNGIEAEA SMFSHQFHRH 
CTDNDVRRDL AMMRRIEQMQ QKHINWLKPI DETPLETTIG YEHVAVDLTA WLAQNEPDPY
VKQTLDFILL EDFDHLYRYA NLLDMDSNIP ANNLVKNYVE IIPGRPTIAH HRHPYDTVSR
HIDFKKADIR TKLNIFIITA GEQQTMNFYN NIGNTYYNNL GRQLYLEIAL VEEEHVTQYG
SLIDPRLTWF ESLLLHEYTE CYLYYSFYES EVDSNVKSIW EMHLDAEIAH LHKAAELLQK
HENKSWCEVI PGGQFPNLLL FHDTREYVRK VLSQMVEITA DKEDLKNVND LPENHTFFWF
QNKVNHDINS VASHVVIDKH NTLKGEDYRV ELAPHPVESL RNRKEDNTSI GRSKQKKVVS
I