Gene Athe_1242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1242 
Symbol 
ID7409716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1331365 
End bp1332648 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content31% 
IMG OID643715607 
Producthypothetical protein 
Protein accessionYP_002573115 
Protein GI222529233 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.684679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA CTTTTAGAGT TAGATTGAAG CAGCAGGTTA TTCCAGCTAT TTTGTACTTG 
ACCTTTTTGA GTGGTTTTTT TGGGAGTACT CTTGCGTATC CAAAGCTAAG TTACCTTTTT
GCATATAGAA TATTTTTGGC ATTTTTGTTT TTTCTCATTT TTATCGACAT AGTATTAAAT
GGAATTGAGC TGAAGAGTTT TCTTAACTTT TCGACTTTTT TTCTAATAGG GTGGTGTGCC
TACTCACTTT TAAGTTTTTT ATGGGCTCAG GATATAAAAA GTGCAGTGAG GGACCAGATT
TTTTTAACCG TCAATATATT TGTGATACTG ATATTTATGT ATTACTCAAA ATATCTTAGA
TGGAATATAA TTGAAAACAT AATATTAATT TCATTTATCA TCCATCTTGC TGTAGGCTAT
TTCGAAGTAA TTACTGACAA ACATTTGTGG ACATCTAAGG TACCTTTATA TAATCTTCAT
AGAACACCCT CAACCTTTTT TACAAATCCA AACGATTTTG CAACATATTT GGTTTTATAT
TTGCCATTTA TTTTAGCCGT TGCAGTAAAC AAGAAGAATA ATAATTTTTT CAGAAAATGG
ACAGCCTTTT TAGGCACAGT TTTGGTTATT CCTCTTTTAA TTCTTACAAC AAGTAGGGCA
AATTACATAG GATTTTTGAT AACTTTGATT ATTTATTTTC TTTTAACAGA TAAAGACCTG
AAAAAGAGTC TTCTACAATA TGGAGCTATA CTTTTAATTT TTTTAATGCT TATAATAGGT
TTTAGACTGG ATTTTGGAGC GTTTAATAAG GCAGTTGAAA TGATAAAAAT TCAGATTTCT
TCGCTTGCTG ATTTTTCGCA GACTTCTCTT TCCTCTAATG TACGGCGTGA GCTTTTGATT
GTGTATGGTC TTTCGTTTTT ATACGACTAC CTCTTTTTTG GTGTTGGTTC AGGCAACAGC
AGGGTTTTGA TGGAAAAGGT AAAACAGTAT ACTGTAAATG TTGAACTTCA TAATTGGTTT
TTGGATGTTC TTGTGTGTTA CGGCGTGGTA ATATTCATCT TGTATCTTAT TTGGATAGTC
TATATACTTT ACAATCTTTT TGAAATAAAA AAGAGCAGTA ATACTTTAAA CCTACCAACA
ATCCCTTTAA TAAGCTCTAT TTCTGCATTT TTTATATCAA GCATAAGTTC ATCGAAGATG
ATAGAGATGA GGGTAATGTG GTTTATATTT GCACTTTCGC TGTTTGTTTT AGTAAAGTCA
AAAGAAGAAA AAGGAGAGTC TTGA
 
Protein sequence
MENTFRVRLK QQVIPAILYL TFLSGFFGST LAYPKLSYLF AYRIFLAFLF FLIFIDIVLN 
GIELKSFLNF STFFLIGWCA YSLLSFLWAQ DIKSAVRDQI FLTVNIFVIL IFMYYSKYLR
WNIIENIILI SFIIHLAVGY FEVITDKHLW TSKVPLYNLH RTPSTFFTNP NDFATYLVLY
LPFILAVAVN KKNNNFFRKW TAFLGTVLVI PLLILTTSRA NYIGFLITLI IYFLLTDKDL
KKSLLQYGAI LLIFLMLIIG FRLDFGAFNK AVEMIKIQIS SLADFSQTSL SSNVRRELLI
VYGLSFLYDY LFFGVGSGNS RVLMEKVKQY TVNVELHNWF LDVLVCYGVV IFILYLIWIV
YILYNLFEIK KSSNTLNLPT IPLISSISAF FISSISSSKM IEMRVMWFIF ALSLFVLVKS
KEEKGES