Gene Athe_0009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0009 
Symbol 
ID7407244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp9187 
End bp11097 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content32% 
IMG OID643714423 
Productsecreted protein 
Protein accessionYP_002571948 
Protein GI222528066 
COG category[R] General function prediction only 
COG ID[COG4880] Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00303672 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA AACGTTTTTT AGGAATGATG GTAGCTTTAT TTATAATATT ATCTTCTGGT 
ATCAATTTTG TCATGTCAGA CCAGTCGAAA CCAAAAAGCT TTCTACAAAA AGTAGGAACT
TTTAGCAACT TTAAAAAACT TGTTGATGAA GCTCTGAAAA AAAATCCATA CATGGGATCA
GGGGAAAATA TAATGTATGA ATCAGCACCA GATTTTGTTA TAAAGGGTGA AAATAAAGAG
GGTTCTGAGT CTTTTTATTC ACAGACAAAT GTGCAGGTTA TGGGCGTTGA TGAAGCTGAT
ATTGCAAAGA CAGACGGGCA GTATATATAC ATTGCAAAAC CATATCCAAA GATAAAAGAT
AATGGAATTG TAATTGTTAA AGCTTATCCA CCTGAGGAAA TGAAAGTTTT ATCTAAAATT
AAATTGAGCG ATGAGTTCTA TCCAGAAGAA TTTTATGTTG ATGGCAGGTA TCTTGTTGTT
ATTTGTGAAA AAGAAAAGAT TGTAGATAGG GAACCTGTTA TTCAAAAAAA TAATTTTCCT
GATATCGATA CAAAGAAAAT AAAAGTTTAT GTGCCATATC AGTTATTGAT AGAGACGTAT
TGTATTGTTT ATGACATTTC TGACAAATCA AATCCTAAGG AAATAAGAAG GGTTTCAATT
TCGGGAAGAT ATCTGACATC AAGGAAAATA GGAACAAGTC TTTACATTGT GATAGAAAAG
CAATTCCCAT ATAGAATTTA CGCTAATAAA TCCTACTCTG AGCAAGATTT CAAACCATAC
TTTTCAGATA GTATTACAGG TAGTTCAAAA AAGATATATA TAAACTTTGA CAGAATAAAG
TATATTCCAG ATTTTATAAA CTGTAGCATT ACTGTTATTG GAAGTTTCGA TATTGAGTCA
AAAGATAAAA TTTGTGTTGA GTGTGTGCTT GGTGGAGGGA ATATAGTTTA CTGTTCGCAA
GAAAATCTTT ATTTGTGCTC TGAAGTAATA AAAAAAGTTT ACTGGCCTGA AAAATGGGAA
GATAATACAA GACCATGGTA TAGTTATGTA AGAAAAACAA TGATTAGCAG ATTTGAACTT
TCAAAAGGGA AAATTAATCT TGAGGCAGCA AGTGTTGTCA GTGGAAAAGT GTTGAATCAA
TTTTCAATGG ATGAACAAGA TGGTTATTTC AGAATTGCAA CAACTGGTGA AAGGCTTTAT
TTTCCAGAAA AAAATTACGA TTATTTTAAT GCTGTGTATG TTCTTGATAA AAACTTGAGA
GTAGTTGGGA AGATAGATAA TATTGCAAAA GGAGAAAGAA TATATTCAGC AAGATTTATT
GGAAAAAGAC TATATCTTGT TACCTTCAAA GAGCTTGACC CATTTTTTGT AATTGACCTT
GAAGACCCTC ATAATCCAAA AGTGCTTGGG TATCTTAAAA TTCCAGGATA CTCAACATAT
CTTCATCCAT ATGATGAAAA TCACATAATA GGATTTGGCA GGGATGCAGA GGATTTAAAT
GAAGAATATG CAATTCCTTT AGGACTGAAA ATTGCAATGT TCAATGTTGA GGATGTAAAA
AATCCAAAGG AGCTGTTCAA AATCATAATT GGGGGCAAGG GTACTTATTC TGAACTTCTT
AATAATCACA AAGCGCTTTT GTTTGATAAA AGTAAAAACA TTTTTGCATT CCCTGTAGAG
GTTTACGACA AAAAAGGGCA TAACTTCACA GGTGCTTTTG TATATAGCAT AGATTTGAAA
GAAGGTTTTG TTTTGAGGGG CAAGATTTTG CATGAAATTG GTGATGGATA TTGTGAGGAG
ATAGACAGGC TTTTGTATAT TGGTGATGTG CTCTACTCAG TTTCAAACTC AATGATAAAA
GCAAGCTCTC TTGAAAGCTT CAAGGAGATA GCAAGGTTGA GGTTGGATTG A
 
Protein sequence
MMKKRFLGMM VALFIILSSG INFVMSDQSK PKSFLQKVGT FSNFKKLVDE ALKKNPYMGS 
GENIMYESAP DFVIKGENKE GSESFYSQTN VQVMGVDEAD IAKTDGQYIY IAKPYPKIKD
NGIVIVKAYP PEEMKVLSKI KLSDEFYPEE FYVDGRYLVV ICEKEKIVDR EPVIQKNNFP
DIDTKKIKVY VPYQLLIETY CIVYDISDKS NPKEIRRVSI SGRYLTSRKI GTSLYIVIEK
QFPYRIYANK SYSEQDFKPY FSDSITGSSK KIYINFDRIK YIPDFINCSI TVIGSFDIES
KDKICVECVL GGGNIVYCSQ ENLYLCSEVI KKVYWPEKWE DNTRPWYSYV RKTMISRFEL
SKGKINLEAA SVVSGKVLNQ FSMDEQDGYF RIATTGERLY FPEKNYDYFN AVYVLDKNLR
VVGKIDNIAK GERIYSARFI GKRLYLVTFK ELDPFFVIDL EDPHNPKVLG YLKIPGYSTY
LHPYDENHII GFGRDAEDLN EEYAIPLGLK IAMFNVEDVK NPKELFKIII GGKGTYSELL
NNHKALLFDK SKNIFAFPVE VYDKKGHNFT GAFVYSIDLK EGFVLRGKIL HEIGDGYCEE
IDRLLYIGDV LYSVSNSMIK ASSLESFKEI ARLRLD