Gene Athe_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2106 
Symbol 
ID7408815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2233160 
End bp2234242 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content38% 
IMG OID643716472 
Productprotein of unknown function DUF362 
Protein accessionYP_002573955 
Protein GI222530073 
COG category[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000135032 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACA ACATTTACAT TATCTACGGT AAAGATGCTA AATCTATGAC AAAACAGCTT 
CTTGAGTATG CCGATGTGAA AAGTTATATT CCTCAGGGGA GCAAAATTGC TATAAAACCC
AACTTGGTTG TTGCAAAGCC GTATACCTCA GGTGCTACAA CAAATCCACA TATTGTTGAA
GGAATCATAG AGTACTTGAG GGAAAATGGG TTTGAAAACA TTGCAATTTT AGAGGGTGCA
TGGCTTGGCG CTTCCACAAA AAGGGCGTTT GAAGTTTGTG GGTATACTGA GATTGCAAAA
AAATACGGTG TAAAGCTTAT TGACACAAAA GACGATAGGC CTTTGAAGAT AAATGTTGAT
GGGTTTGAGC TGAACATTTG TACCCAGGTC TATAGCTACG ACTTTTTAAT AAACGTTCCA
CTTTTGAAAG GGCACTGCCA GACACAACTT ACCTGTGCTT TGAAAAATCT CAAAGGGCTT
ATTCCTGACA GTGAAAAGAG AAGGTTTCAC ACACTTGGTC TTCACAAACC AATTGCATAC
TTGAACAAAG CAATAAAAAC GCATCTTGTA GTGGTAGACA GTATTATGCC AGACCCTGAC
TTTGAAGAGG GAGGAAATCC TGTTGAGAAG GATTTTATAG CCCTTGGCTT TGACCCAGTT
TTGATAGACA GCTTTGCCGC CGAGCATTTG GGTTACAACC CATATGACAT TGAATACATA
AGATTAGCAG AAAAATTAGG TGTTGGGAAA GCAGGTGAGT ATAATCTCAT AGAAATAAAT
TCTGATAAAA AACCTACAGG AGTTTCAAAA AGGTCTTCAA TTGTTTCAAG ATACACAAAA
TATATTGAAG AAAAAGACGC ATGTTCTGTA TGTTATGCAA ACCTCATAAG TGCTCTTATG
AGGTTAGACG AGCAGGGGGT TTTAAAAAGA CTTTCGAAAA AACTCTACAT TGGACAAGGC
TATAAAGGGA AAGTTATGGA TGGAATAGGA ATTGGGAGCT GTACAAGCGA TTTTAATATA
TGCAAACAGG GATGTCCTCC AAAGTCAAAC GAGATTGTTG AATTTTTAAA ACAGAATTTA
TAG
 
Protein sequence
MNNNIYIIYG KDAKSMTKQL LEYADVKSYI PQGSKIAIKP NLVVAKPYTS GATTNPHIVE 
GIIEYLRENG FENIAILEGA WLGASTKRAF EVCGYTEIAK KYGVKLIDTK DDRPLKINVD
GFELNICTQV YSYDFLINVP LLKGHCQTQL TCALKNLKGL IPDSEKRRFH TLGLHKPIAY
LNKAIKTHLV VVDSIMPDPD FEEGGNPVEK DFIALGFDPV LIDSFAAEHL GYNPYDIEYI
RLAEKLGVGK AGEYNLIEIN SDKKPTGVSK RSSIVSRYTK YIEEKDACSV CYANLISALM
RLDEQGVLKR LSKKLYIGQG YKGKVMDGIG IGSCTSDFNI CKQGCPPKSN EIVEFLKQNL