Gene Athe_1343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1343 
Symbol 
ID7408924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1428993 
End bp1430138 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content32% 
IMG OID643715708 
ProductPHP domain protein 
Protein accessionYP_002573216 
Protein GI222529334 
COG category[S] Function unknown 
COG ID[COG1379] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00375] conserved hypothetical protein TIGR00375 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000142692 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGTAT ATGCCGATTT GCACGTCCAC ATTGGATTTT CTAATGGAAG GTATATAAAG 
GTACCTTCTT CAAAAACCCT TACACTGGAG AATATTATAA AAACAGCAAA AGATGAAAAA
GGACTTAATG TGATTGGTAT TGTCGATTTT TTTTGCAAGG ATGTAATTGA AGAAACTGAT
AAGCTATTAG ATAAAGGAAA GCTTGAATTG AAAGATGGAA GCTTGTATTC TGAAAGGCTT
TTGATAATAC CTGCAGCCGA GATTGAATTG AGGTTTTGTT CAAATGATTT TCACTGTCTT
GTCTTTTTCG AAGATTATGA AAAGTTAAAA GACTTTAGAA AAATAATTAA GACCTACTTT
AATCAAATTG ATTTTAGCTG CCCGGTCTTC AGAGGCCAAA TTAGTGAATT TGAAAAGATT
GTATCATCTT TTGGACTTTT AACTGTGCCT GCACATGCAT TTACACCATA TAAAGGTTTT
TATTCAGTCG CACAAAGAGT TGAGGAGGTT TTTAAGAAGA TAGAGGTTTT TTCAATAGAG
CTTGGTCTTT CTGCAGACTC AAAAATGGTA AGTTATCTTT GCGATGTCCA AAAAAGGAGT
CTTCTTTCTA ACTCAGACGC TCATTCTTTA AAAAATATTG CAAGAGAGTT TAACGAAATT
GAGGTTGAAA ATGTTTCTGC AAAAGATGTT ATAAAAAGTT TAAAGGAAAA CAGAATAAAG
GCAAACTATG GCATAAACCC AAAACTTGGA AAATATCATA AATCGTATTG CAATAGATGT
AACAGTTCAT TTAATTTAAA AATCCAAAAT GCAATATTAT GTCCATTTTG CAAAAGTAGA
GACATTGTAA TTGGTGTTGA GGACAGAATT TCTTGGCTTT GCAGGACAGA AAATCCTATT
GAAAAACCAC CTTATTTTTA CACATTCCCT TTTGAACTTG TAAAGGGATT TGGGCAAAAG
ACATTCCAAA AAATAATTGA TGTTTTTGGA AATGAGATAA ACTTTATCAA ATCTCTAAAT
AATGGTACTT TTAAGAATTA TAACATTGAT GAAAATGTGG CTCAAAAACT TGTAAGATTT
ATGAATCAAG ACTACACCGT TAAATTTGGT GCTGGAGGGC ATTTTGGAAG AATTATATTT
GAATAA
 
Protein sequence
MRVYADLHVH IGFSNGRYIK VPSSKTLTLE NIIKTAKDEK GLNVIGIVDF FCKDVIEETD 
KLLDKGKLEL KDGSLYSERL LIIPAAEIEL RFCSNDFHCL VFFEDYEKLK DFRKIIKTYF
NQIDFSCPVF RGQISEFEKI VSSFGLLTVP AHAFTPYKGF YSVAQRVEEV FKKIEVFSIE
LGLSADSKMV SYLCDVQKRS LLSNSDAHSL KNIAREFNEI EVENVSAKDV IKSLKENRIK
ANYGINPKLG KYHKSYCNRC NSSFNLKIQN AILCPFCKSR DIVIGVEDRI SWLCRTENPI
EKPPYFYTFP FELVKGFGQK TFQKIIDVFG NEINFIKSLN NGTFKNYNID ENVAQKLVRF
MNQDYTVKFG AGGHFGRIIF E