Gene Athe_0177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0177 
Symbol 
ID7407168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp217118 
End bp219268 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content32% 
IMG OID643714579 
ProductNHL repeat containing protein 
Protein accessionYP_002572102 
Protein GI222528220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAA AGGTGAGGAC AGTGAAAGTT GAATTAAAAA GTATAAAGAG AAGTATAGTA 
CTCATTTTTG CCATAAGCCT AATATTCCAA ATAATAAGTA CTACAAAAGT GTACGCATAT
ATAATTGAAG ATATGCGTGA AGTTCCATAT TATCAATACA ATTATGATGT ATATCAGCAT
GAAATACCAA GCGCTGCTGG TTATTATCCA GCAAAAACCA TAAGAGGAGA ATATATCGGT
TCAGGCAGTT TTAAAAATCC AACAGATCTA TATGTTGATA ACAATAATAC TATCTATATT
ATGGACAGTG GCAACAAGCG AGTAATAGTC TGCGATAAAG ACTTCAAACT TAAGAAAATT
ATAGATAAAT TTTTCGATTC TAATGGAACT ATTCAACTTA TTGAACCAGA TGGAATATTT
GTCGATAAAA GTGGTTTAAT TTACATATGT GATAAGGGTT CTAAACTGGT CTGGATATTC
AACCAAGATG GCAAATTAAT AAGAACCATA GAAAAGCCTG TATCAGACCT GAAAGCTGCT
AAAAAAGATT TTATTCCATT TAAAGTTGTT GTTGATAATG CAGGGATTAT CTATGTTCTT
TCTTTGGGAA GTTTTGAAGG TGCTTATATG TTTGACCAAA ATGGGAATTT TTTGGGGTTT
TACGGGAGCA ACAAAGTAGT TGTTACATGG CAGTTATTGG TTGATAGGTT TTGGAAGAGT
ATTTTAACAA AAGAACAGAA AAGTTCTATG GTAAGATATT TACCAACGGA ATGTACCAGC
ATCGACATAG CTAAAGATGG ATTTATATAT ACCTGTTCAA ATTACACTGA TGTAAGCGAA
GGTGAAATAA GAAAATTAAA CTATTTAGGC GAAAACATAC TTTGGTATAA AAAGCAAGGC
AGAACAAGAG ACTATGGTGA TATTTCAAAA TATCAGGGGA AGGAGTTGGA AGATTCATAT
TTTATAGATA TTGACGTGAC TGATGATGGA TTTATCAATG CACTTGATTA TGAACGAGGA
CGAATTTTTC AGTATGATCA AAATGCGAAT TTGCTATTCA TATGTGGAGG GAAAGGTGAT
CAGGTAGGCA CATTTAAAGA TCCAGTGGCT ATAGATAGTA TAGCAAATGA TTTAGTTGTT
CTTGATAAGC TAAAAGGAAC AATCACAGTT TTTAAAGAAA CAAAATTAGG AAGCTTAGTA
CATAAAGCAA CGCTGCTATA TAATGAAGGT AAGTACGATG ATGCAAGGCA TTTATGGGAA
CAAGTTCACA AAATGGATTT CAATTTTTCA CTTGCACATG TAGGTCTTGG CAAAGCACTT
TTGAGGATGG ATAAATATTC AGATGCGATG TATTATTTCA GGCTTGCAAA TGATAAAGAT
GGTTATTCAG AGGCAAAAGA AGTTTTAAGA AATGAATTTT TAAAAAGAAA TTTTGGGGTT
ATAGCAACAA CTGTAATTGC TATAATAGTA CTTTTGTATG TATTAATAAA ACGCTTTAGA
AAGCCACCTA CAGCTCAAGA CATTTATACT AAAAAGATCG ACAAGTATAA ATATCCTTTA
CATGTAATGC TGCATCCATT TAGAGGCTTT GAAGAATTAA AAGAAGAGAG GAAAGGTTCT
GTTATAATTG CAACATTTAT TGTGTTTATA TTCTTTGTAA CTATGGTCAT AAATAGACAA
TACACAGGAT TTATCTTTAA TCCGTATAGA CAAGATAAAA TTAACATATT GTCAATATTC
TCAAGTACAG TTGGTATATT TTTCTTCTGG GTACTTTCTA ACTGGATGGT ATCCACTCTT
ACTGAGGGTG AAGGCAAGTT TGGAGAAATA TGGGTATTTT CGGCATATTC TCTGACACCA
TATATTATTT GTACTCTTTT AGCAGTTGTT ATGAGTCAAT TCATGATTGC AGAAGAAGCA
ATGTTTATAA ACTTTGTAAG ACTTATTGGC ACCTTATGGC TTGTAATATG CATATTTAAT
GCAATAAAAT CTGTTCATCA GTATACTCCA AGTAAAACTA TAGGCACAAT AGCTTTGAGT
GTTTTGGGAG TAGGAATAAT TCTTTTCATT ATTGTATTAA TGCTTACACT GTTTGGACAG
TTAATAGACT TTATCAATAA TGTGTACAGC GAGATACTCC TCAGGATATA G
 
Protein sequence
MKVKVRTVKV ELKSIKRSIV LIFAISLIFQ IISTTKVYAY IIEDMREVPY YQYNYDVYQH 
EIPSAAGYYP AKTIRGEYIG SGSFKNPTDL YVDNNNTIYI MDSGNKRVIV CDKDFKLKKI
IDKFFDSNGT IQLIEPDGIF VDKSGLIYIC DKGSKLVWIF NQDGKLIRTI EKPVSDLKAA
KKDFIPFKVV VDNAGIIYVL SLGSFEGAYM FDQNGNFLGF YGSNKVVVTW QLLVDRFWKS
ILTKEQKSSM VRYLPTECTS IDIAKDGFIY TCSNYTDVSE GEIRKLNYLG ENILWYKKQG
RTRDYGDISK YQGKELEDSY FIDIDVTDDG FINALDYERG RIFQYDQNAN LLFICGGKGD
QVGTFKDPVA IDSIANDLVV LDKLKGTITV FKETKLGSLV HKATLLYNEG KYDDARHLWE
QVHKMDFNFS LAHVGLGKAL LRMDKYSDAM YYFRLANDKD GYSEAKEVLR NEFLKRNFGV
IATTVIAIIV LLYVLIKRFR KPPTAQDIYT KKIDKYKYPL HVMLHPFRGF EELKEERKGS
VIIATFIVFI FFVTMVINRQ YTGFIFNPYR QDKINILSIF SSTVGIFFFW VLSNWMVSTL
TEGEGKFGEI WVFSAYSLTP YIICTLLAVV MSQFMIAEEA MFINFVRLIG TLWLVICIFN
AIKSVHQYTP SKTIGTIALS VLGVGIILFI IVLMLTLFGQ LIDFINNVYS EILLRI