Gene Athe_2124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2124 
Symbol 
ID7408833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2257761 
End bp2259035 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content32% 
IMG OID643716489 
Producthypothetical protein 
Protein accessionYP_002573972 
Protein GI222530090 
COG category[S] Function unknown 
COG ID[COG4198] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000234804 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAACA TCAAAGGTTT TAGAGGTTTG AGATATTCGC CTGAAATTGA ACTTGACAAG 
TGCATATGTC CTCCTTATGA TATTATTTCT GAAGAGGAAA GAGAGGAACT CTACAGAAAA
AGTGAATACA ACATAATCCG AATTGAGTTT GGGAAAGAGT ATGAGAATGA TAACCAGGAA
AATAACAAGT TTACCCGTGC AAAAGAATAT TTGGATAACT GGATAAAACA GGGAATTTTA
AAATTTGACT CAAATGATTG CATTTATGTT CTTGAACAGG AATTTGAGGT GGATGGAAAG
AAATATGTTC GCACAGGACT GATTGTGGTT CTTGAACTTG TTGAATTTGA AAAGGGTATA
GTTATACCTC ATGAATTTAC ACTTTCAAAA CCTAAGCAAG AAAGACTTGA GCTTATGCGA
AAGACACATG CTAATATCAG CAGCATATTT GGGCTTTATG AAGATAAGAC AAAAGAAATT
AAAGAGATTC TGGATAGAAT AAAATCGAAA AAAGAAGATG TTTCTTACAA TGGTATTGGC
ACATTTGAAA GAATGTGGGT TGTATCTGAC AGTGATACAA TTGAAAAATT GAGGCAGCTT
TTTTATGATA AGAAGATATT TATTGCAGAT GGTCACCACA GGTATGAAAC AGCCCTTGAA
TACAAAAAAG AGATGGAACA AAAAGAGTGC AAACGAGATG ATGCAGATTA TAACTACATA
ATGATTACAC TTACAAGCTT AGAAGACCCG GGAATAGTTA TTTTACCCAC ACACAGGATT
GTTCTTTCAA CAGATATTGA AGAAGATATA TTCGTGGAAA AGTTAAAAGT AGATTTTGAG
GTTGAGCAAG GAGATTACAA AAGATTAAAA GAGAAGTTAG AAATAAAGAA GAAATATGCA
TTTTTGGTTT ACACATATAA CCATAACTTT TACCTAATAA CTTTAAAAGA ACCTGAAAAC
TCCTTAAGAG AAATTGAGGG AAGTAAAGCT TATAAAAATC TTGATGTTGT GATATTGCAA
AAACTTGTTT TAAATAAAGT ACTTGAAATT ACAGATGAGC ACATTCTTTA TCAAAGAAAT
ATAAAGTATA CCAAGTCTGA TAAGGAACTA ATAGAAACAG TAAACAAAGG CGCCAAATAC
GGTTTTATTT TGAATCCAAC CCTTGTTGAG GAATTAAAAG ATGTATCTTT AAGTGGCGAA
AAAATGCCTC AAAAGTCCAC ATACTTTTAT CCAAAACTTA TGACAGGAAA TGTAATGTAT
GTTCATTTAA AGTAA
 
Protein sequence
MANIKGFRGL RYSPEIELDK CICPPYDIIS EEEREELYRK SEYNIIRIEF GKEYENDNQE 
NNKFTRAKEY LDNWIKQGIL KFDSNDCIYV LEQEFEVDGK KYVRTGLIVV LELVEFEKGI
VIPHEFTLSK PKQERLELMR KTHANISSIF GLYEDKTKEI KEILDRIKSK KEDVSYNGIG
TFERMWVVSD SDTIEKLRQL FYDKKIFIAD GHHRYETALE YKKEMEQKEC KRDDADYNYI
MITLTSLEDP GIVILPTHRI VLSTDIEEDI FVEKLKVDFE VEQGDYKRLK EKLEIKKKYA
FLVYTYNHNF YLITLKEPEN SLREIEGSKA YKNLDVVILQ KLVLNKVLEI TDEHILYQRN
IKYTKSDKEL IETVNKGAKY GFILNPTLVE ELKDVSLSGE KMPQKSTYFY PKLMTGNVMY
VHLK