Gene Athe_2767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2767 
Symbol 
ID7408337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2917950 
End bp2919074 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content32% 
IMG OID643717123 
Productgermination protein, Ger(x)C family 
Protein accessionYP_002574592 
Protein GI222530710 
COG category 
COG ID 
TIGRFAM ID[TIGR02887] germination protein, Ger(x)C family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000418614 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT TTGTATTTTA TTTAACACTG ATTTTATTAA CATTAGCCTT TCTGACAGGT 
TGCTGGAATA GAAAAGAACT CAACGATATA TTGATTGTTC AGGCAGTTGG AATTGACAAA
AACCAAAGTG GACAGTTCAA ACTCACATAT CAGGTATTAA AACCAAAGGT GCTCAAAACT
CCGACAAATA TACCTTCCAG TTTTCAGCAA AAAGGAGTGT GGTGTTTTTC TTCAACAGGC
AAAAGCATAT TTGATGCTAT TCGAAATACA ACACTTTCGT CAGACAAAAA ACTTTTCTTT
TCACACAACA AGATTATTGT TATTAGCGAA AAAGTTGCAC ATCAAGGAAT TGAAAATGTT
CTTGACATAT TCTTAAGATA TCATGAATTT CGTCCAGATG CGTATCTAAT TGTTACTTCA
GATGACATCG AAAAATTCTT AAATTCAAAT GTCCCAATAG AGTCAATACC TGCAAAAGAA
CTTGAAAATG TAATAAAAAA TTATTTTGCA AACTCAAAAA CGCTTCCTAT TAGCATATAT
GAGTTTCAAA AGATGTCAAA TACTAAGTCA AAAACTGCAC CTGTCCCATT TGTGACAATA
AAATCACCAT TAAAACAATC TTCACAATCC CAGATGTTTT ATGTAGAAAA AATGGCTGTG
TTTAGCAACT ATAAACTAGT AGGGTATCTT ACACACGAGG AGTTAAGAGG ACTTTTGTGG
GCAGCTGGTA AAATAAAAAG AGGGATATAT CCTATAAAGC TTGGAAAAGA CATATTTTCT
TTAGAACTTA TTCAAAGCAG GAGCAACATC AATGTTAAAA GAAAACTTGG CAAAGCTTTT
TTCATCCTGC AAATAACCAC AGAGACAAAC CTTGGTGAAA AATATTCAAA CTCTTATATC
TCAAGTTCAT TGGTTGAAAA AACAAAAAAA GAGTTTAGTA AATCTATCAG AAACGATGTT
CAGAAGGTAT TGAAAAAATC GTTTGAACTC AATTGCGATA TTCTGCACCT TGGCGATATT
TATTACTCTT CATACAAAAA GCCTTTAAAT TTTGACAAAA ATTCTATTTC GGTCTCAATT
GTTGTAAAGC CTTTCATAAG GCGATTTGGT ATGATGAAAG AATGA
 
Protein sequence
MKKFVFYLTL ILLTLAFLTG CWNRKELNDI LIVQAVGIDK NQSGQFKLTY QVLKPKVLKT 
PTNIPSSFQQ KGVWCFSSTG KSIFDAIRNT TLSSDKKLFF SHNKIIVISE KVAHQGIENV
LDIFLRYHEF RPDAYLIVTS DDIEKFLNSN VPIESIPAKE LENVIKNYFA NSKTLPISIY
EFQKMSNTKS KTAPVPFVTI KSPLKQSSQS QMFYVEKMAV FSNYKLVGYL THEELRGLLW
AAGKIKRGIY PIKLGKDIFS LELIQSRSNI NVKRKLGKAF FILQITTETN LGEKYSNSYI
SSSLVEKTKK EFSKSIRNDV QKVLKKSFEL NCDILHLGDI YYSSYKKPLN FDKNSISVSI
VVKPFIRRFG MMKE