Gene Athe_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1954 
Symbol 
ID7407368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2066856 
End bp2068151 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content33% 
IMG OID643716326 
Productgermination protein, Ger(x)C family 
Protein accessionYP_002573814 
Protein GI222529932 
COG category 
COG ID 
TIGRFAM ID[TIGR02887] germination protein, Ger(x)C family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00684146 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATA GCAAAAAAGC GGTATTAGGT ACTTTAATAA TATTTTCAAT ACTTTTTTTG 
TCAGGTTGCT GGGATAGGGT TGAGATAGAA GACAGAGGTT ATATTCTTGC ACTTGGTGTT
GACAAATATG ACCCAAGTGA TTTGAACAAG TACGAAACAA GTGAATATAT TAATCTTGAC
AGAAACACTC AAAAATTTTC TCCAGAACAA AAAAAGCCAG ATATAAAAAC TAACCAGAAG
GGTATTGACC CACAAACAAA AAGAAAGGTA AAACCCCCTC TTCCAAGTAG TAAAAATCAA
TACAAGTTTG CAGTGACAGT ACTTTTCCCA AACCTCAGAA CTATAGGAAA GGATTCAAAG
CAAGATGAAC AAATGAGATT TTTATTTGTA AGACCTACAA ATAATGTTGT TGGAATTAGA
AACTATCTTG AGAGAGAGAT AAACAAAAGG CTATATTATG GATATTTAAA AGTAGTTGTT
TTTGGAAAAG ACCTTGCTAA AGACCCTGCA CTTGTAAGAG AGGCTTTAGA TGGACTTAAT
AGAGAAAGTG ATATTCCACA GAACACATTT TTACTTGTTT CTGAAACAAC AGCAAGGGAT
ATCCTTAATA CCATGCCACT TGTCCAACCT GTAACTGGTA TTCATCTTTT TGAGATATCA
AAAAGCGCTT CAATTTATGG TAGGGTTATT GATACCCCAT TGAGTCAAGT TGTAAATAGT
TTTATAAATT CAAACTGTGC AGTTATCTCA AGAGTAGAGC CTGGAATTGA AACATTGAAA
ATAGCTGGTG CCGCTGTATT TAAAAACTTT AAATTTGTAG GGTGGATGAA TGAAAAGCAG
CTTCAAGTCT ATAAGCTTCT GATGGGAAAA GCTAAGCATA CATTTATTGA CGATTTAAAA
TACAAATCTT CCTATATTCC GTTTGTGACA ACAGAGATAC AGGTAAAGAA AAAAATAAAG
GAAGATAAAG GCAAATTAAG GGTTATATAC AACTTACGAA TAGAAGGTGA GGTTCCTGAA
TTTGTTTTCA AAAGTGATTA TAAAGTTTTG GATGATCCTA TGAGAAGGTA TATACAAACT
GAACTGAATA GAATAATAAA ACAAAGGTCA CAACAACTAT GTTATATGTT AAATGTAAAA
TACAATGCAG ATGTATTGGC AATAGGTGAT TTCATATCTA AACACAGACC AAAAGAATGG
GAAAAGCTAA AGAAAAACTG GGATACTGAG CTAAAGAAAA TTAAAATAGA AGTCATACCG
GATGTGAGAC TTAGAAGAAG TGGAACAGTA TTTTAA
 
Protein sequence
MKHSKKAVLG TLIIFSILFL SGCWDRVEIE DRGYILALGV DKYDPSDLNK YETSEYINLD 
RNTQKFSPEQ KKPDIKTNQK GIDPQTKRKV KPPLPSSKNQ YKFAVTVLFP NLRTIGKDSK
QDEQMRFLFV RPTNNVVGIR NYLEREINKR LYYGYLKVVV FGKDLAKDPA LVREALDGLN
RESDIPQNTF LLVSETTARD ILNTMPLVQP VTGIHLFEIS KSASIYGRVI DTPLSQVVNS
FINSNCAVIS RVEPGIETLK IAGAAVFKNF KFVGWMNEKQ LQVYKLLMGK AKHTFIDDLK
YKSSYIPFVT TEIQVKKKIK EDKGKLRVIY NLRIEGEVPE FVFKSDYKVL DDPMRRYIQT
ELNRIIKQRS QQLCYMLNVK YNADVLAIGD FISKHRPKEW EKLKKNWDTE LKKIKIEVIP
DVRLRRSGTV F