Gene Athe_2766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2766 
Symbol 
ID7408336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2916847 
End bp2917953 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content33% 
IMG OID643717122 
Productspore germination protein 
Protein accessionYP_002574591 
Protein GI222530709 
COG category 
COG ID 
TIGRFAM ID[TIGR00912] spore germination protein (amino acid permease) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000369492 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAATT CAAAAATATC ATATTCTCAG TTTTTGTGTC TATTTTTTGT AAACAGAATA 
GTGATGGTTC TTACATTCTT GCCAATATTC AATGCGCCAC CAAAAAACCA GGACGTATGG
CTTTCGGTTG TGTTCATGTA CCCATTTTCT ATCTTAGTAT CAATGCCACT TATTTATCTT
TGTTCTTTAT TTCCAAACCA TACATTTACA AATATCCTCA ATATTGTATT TGGAAGATTA
TCTATAGTTT TGTCTATCCT ATATGTCTGG TTTTTCTTTC ACATCACAGC AATACAGGTT
GCACAGTTTG TTGAGTTCAT GGCAACAGCT GTCATGCCAG AAACACCAAT TCTTTTTTTC
ATAGTGACAA TGTTGACTGT CTGTTTTTAT GCACTTAAAA AAGGGATAGA GCCTTTGGCA
AGATTTGGCC AGATAACAAC ATTTGCTACA ATCTTGAGTC TTGCAATTAT TATGATAATA
TCTTTAAGAT TTTTCGACCC TGCAGCTTTA AAACCCGTTT TAGAGAAAAA TATATCACAG
TCTATCTTTG GTGGAATATA CCTGTGTTCA CTTAGTTCTG AAATAATCAC AATTGGTATG
ATAAACCCGT ATATAGTCAA GAACAAAAAC TGTACAATCA AAGACATGAC AAAAACTATT
GCACTTGCTT TCTTGCTGGT TGACATATTT TATCTTGCAA TTACAATCAT AGTACTTTCT
CTTTTTGGAT ATACCCAGGC AAGCAGGCTC TCATTTCCAT TTTATTCTGC TATCAAAGTG
TTAAGTGTTG CAGAGTTTCT TGAAAGGTTT GAATCGCTTC ACATGGCAAT TTGGATAATG
GGCATATTTC TTAAAATCAC ATATTTCATG TACATTCTTC TTACTACAGT GCAGGAGCTG
AGAAACACTG CAGACCATTT TGCATATGCA ATTCCTTTTA CATCTGTGCT TGCACCCTTT
GTTTTTTATA TAATTCCTAA TTTTTTGTCC CTTGACAGGT TTATGAGCTA TAAGTATTTT
ACACTGTACT CATACATCTT CATATTTTTC ATACCGCTTT TTACCCTTAT ATTTGCAAAA
ATAAAGCTGA GGGCGAGAAA AAAATGA
 
Protein sequence
MSNSKISYSQ FLCLFFVNRI VMVLTFLPIF NAPPKNQDVW LSVVFMYPFS ILVSMPLIYL 
CSLFPNHTFT NILNIVFGRL SIVLSILYVW FFFHITAIQV AQFVEFMATA VMPETPILFF
IVTMLTVCFY ALKKGIEPLA RFGQITTFAT ILSLAIIMII SLRFFDPAAL KPVLEKNISQ
SIFGGIYLCS LSSEIITIGM INPYIVKNKN CTIKDMTKTI ALAFLLVDIF YLAITIIVLS
LFGYTQASRL SFPFYSAIKV LSVAEFLERF ESLHMAIWIM GIFLKITYFM YILLTTVQEL
RNTADHFAYA IPFTSVLAPF VFYIIPNFLS LDRFMSYKYF TLYSYIFIFF IPLFTLIFAK
IKLRARKK