Gene Athe_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1862 
Symbol 
ID7408975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1954763 
End bp1956301 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content29% 
IMG OID643716234 
ProductKWG repeat protein 
Protein accessionYP_002573723 
Protein GI222529841 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.528379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTA AAAAGAAGAA CAAAAGAAAA AAAAACAAGC AAAAAAAACA GCAAACGAAA 
GTTCAATTCA AAGAATTTTT GAGGTTGCCT ATTGTAAAAA GTTTTATAAT CGTATTAATT
TTGACCTTAT TGATAACGAC AATATATAGC ACAGCAACAA TTGTAATTAG AAAAAATATT
GCCAAACATT TTTTCACAAT CTCAGATGAA AATTATCAAA ATGTAGCATA TCTTAACAAT
AAATTGGTTA TGTTTGAAAA GAATAGTAAA TGGGGGATAA TGGATATACA AGGAAAAATA
GTAGTAAAGC CTACATATGA AAGGATACTA TTAGAAAGTG AAGGAATGAT TCCAGTTAAG
CTTAAAAATA AATGGGGCTT CATTGATATC TATGGAAAAG TTAAGATAAA ACCCCAATTT
GACAATGTAA CTGCATTCAC AGACCAGCTT GCAGCTGTGT GTGTGAACAA CAGGTGGGGA
ATTATAGATA AGTCAGGTAA ATATACTATA AAACCTCAGT ATCAAGACAT AATTATACAT
CCAAACAAAA TGATTCAAAC TAAAAAGTAT AATAAATGGG GAATTATTGA TGAGAAAGGA
AATCAAATTA TACCATATAA ATACAAAGAA ATACAAATTT TGAATTATAA GGGAGTCATT
GTTGCTAAAG AAAAAGATAG TTACAAAATA ATATCGATCA AAAATAAAAA GGAAAGTAAG
GAAAACTATA GCAACTTTTC ATTAAACATT GGGAAAATGT TACCTGTTAT GAGAAACAAT
AAGTGGAGCA TTTTAAATCT TGAAACGTTA TCAGAAGTTT TTCCGTTGAT TTATGATGAA
ATATGGGTCC ACAATGAAGG CTGGATAGAG CTTAAAAAAG ATAAAAAGTT GTACATTATG
TTTTCTGACG GTAAAATATT AGATAAAACT TTTAAAAGGG ATTCAGTAGT TGTTTCCTCA
AACAAAATGA TTTCAATAAA TCAAAATAAT GGAGTCATAA CTTTAGTAAA TCTTGATTCA
CGAAAGACAG TAGATATAAA AGCACATGAT ATAACAGTAT TTAACAATGG TTTTGCGGCA
GTCAAGGTTA GGGACAAGTG GGGGTTGATT GCAGAAAATG GGAAGTTTGT TATTAAGCCA
AAGTATGATT CTATTTGGAT AGCAGACAAA GACATAATTG TAGTTTATCT CAATGGGAAG
TGGGGGTTAG CGAAAATAAA CGGAGAAACT TTAACACCAC TGAACTATGA TTTAATAGGC
GAAGTAAAAG ATGGGTACGT AGCATTTCTG AAAAATGGCA AATGGGGTGT GATGCTAAAG
ACAGGCAAGA TACTGTTGAA ACCGAAATTT GATCAAATTA CACTTCATAC AAAGAACTAT
ATATTTGCTA GACAAAATGA TACATGGTTT CTCATAGTAA TTAAGAATAA TAAAAAATAT
TTCTATAGAT TCAAAACGTT CTCTGTGATA AGAGTTAATG ACAATATTTG GGCATATACT
ACAGAGAAAG GTATGAAAGT TATTATTCTA AAAAAATAG
 
Protein sequence
MNSKKKNKRK KNKQKKQQTK VQFKEFLRLP IVKSFIIVLI LTLLITTIYS TATIVIRKNI 
AKHFFTISDE NYQNVAYLNN KLVMFEKNSK WGIMDIQGKI VVKPTYERIL LESEGMIPVK
LKNKWGFIDI YGKVKIKPQF DNVTAFTDQL AAVCVNNRWG IIDKSGKYTI KPQYQDIIIH
PNKMIQTKKY NKWGIIDEKG NQIIPYKYKE IQILNYKGVI VAKEKDSYKI ISIKNKKESK
ENYSNFSLNI GKMLPVMRNN KWSILNLETL SEVFPLIYDE IWVHNEGWIE LKKDKKLYIM
FSDGKILDKT FKRDSVVVSS NKMISINQNN GVITLVNLDS RKTVDIKAHD ITVFNNGFAA
VKVRDKWGLI AENGKFVIKP KYDSIWIADK DIIVVYLNGK WGLAKINGET LTPLNYDLIG
EVKDGYVAFL KNGKWGVMLK TGKILLKPKF DQITLHTKNY IFARQNDTWF LIVIKNNKKY
FYRFKTFSVI RVNDNIWAYT TEKGMKVIIL KK