Gene Athe_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1241 
Symbol 
ID7409715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1330191 
End bp1331381 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content29% 
IMG OID643715606 
Producthypothetical protein 
Protein accessionYP_002573114 
Protein GI222529232 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCGGC TGAACAGTTA CTTGAAAACC AATAAGTATG AATTTTTGGT TGCTGCAGGT 
GTTGCTACTA CTGTGCAGTT TGTTAGGTAT TCGTATATTC CTGTATATTT ATTTTTGTTG
GCGTTATTTT TTATATCCTT TTATATCTAT AAACCAAAAA TAAGATTTGA TGCTTTGAAT
TTTGTTCCTA TGTTTTTTTA TGTAAGCCTT GCATTGATTT CTCTCTTGTT ACTAAATGTA
AGTTTAAACA AGGACAAGGC TATTATAGGT ATAATAAATG CTTTTGTATT CCCAGCATTG
TTTTATGTCT TTCTCATTTC ATGTAACGGA AATTTTATTT TGAAAATAGA AAAAATATGG
TTGTTTTTAT TAGCAATTGC TTCAGTTGTG TGTATTTTTG AATTTTTGTA CTATATAGCT
TTCAAGAGTT TGAGAGAGAG AACTATTTCA ATCTTTTTTA ATCCAAACAC ATTTGCGTTT
TTTTTAGTTA TGGTTTACCC ACTTGTGATA AACAAGTTGA AAGATGAAAA GTCAAAACTT
TTGGTATCGT TCTTAATATT TATAGAAATC TTACTTTCTG GTTCAAGGAC AGGGTTTGTA
GTATATATAT TCGAGTTTTT TCTTATAAAT ATTTACCTTA TTAGAAAAAA TATCTTAAAG
GTTTTCTTGG CAGTAGCTGG TATATTGACT ATTTTCCTTC CTAAGATTCT CTATAGAATT
CCAAGCTTAA GTGATGTAAC AAATCCTAAA ACGGCTGTTG GGCAGAGAGT TTTTGTGATT
GAGTTTGTTT TGAGATATTT TTCACACAGA AGCCTGTTTG AAGGAATTGG CGCAGGTCAA
TTTGAGCTAT TTTTTAGAAA GTTAAAAGCG CCTGGTTTAG TTGCCCTTCA CTCGGCACAT
AATTTGTTTT TAAATGCCCT TATTGAATAT GGTATAATAG GATATATGAT TTTAGTTTTT
ATAGTTTATT TTTCGGTTTT TCTTTCTGCA TATAATTTTT TTAAACACAA AGAAGAATAT
GATAGAAATA TTTTTATTGG ATTTATTCTT ATAACCATTT TTCAGATGTT TGATATGGCT
GAAATTACAA ATAGTAGGAT GCTATTAATT AACATGCTAT ATACATTTTA TCTTTTCTTG
CCTATTTACA GATTTAAAAG GTGGAGAGCT ATAGATGGAA AATACTTTTA G
 
Protein sequence
MYRLNSYLKT NKYEFLVAAG VATTVQFVRY SYIPVYLFLL ALFFISFYIY KPKIRFDALN 
FVPMFFYVSL ALISLLLLNV SLNKDKAIIG IINAFVFPAL FYVFLISCNG NFILKIEKIW
LFLLAIASVV CIFEFLYYIA FKSLRERTIS IFFNPNTFAF FLVMVYPLVI NKLKDEKSKL
LVSFLIFIEI LLSGSRTGFV VYIFEFFLIN IYLIRKNILK VFLAVAGILT IFLPKILYRI
PSLSDVTNPK TAVGQRVFVI EFVLRYFSHR SLFEGIGAGQ FELFFRKLKA PGLVALHSAH
NLFLNALIEY GIIGYMILVF IVYFSVFLSA YNFFKHKEEY DRNIFIGFIL ITIFQMFDMA
EITNSRMLLI NMLYTFYLFL PIYRFKRWRA IDGKYF