Gene Athe_2673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2673 
Symbol 
ID7407037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2814319 
End bp2815281 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content37% 
IMG OID643717039 
Producthypothetical protein 
Protein accessionYP_002574508 
Protein GI222530626 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000356994 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGCA ATCTACCATC AAAGGAACAC GATTCAACTT TCAAGCTTTT GTTTGAAAAC 
CCAAAAGATA TCCTCTTTCT TGTCAGGGAT GTAATAGGCT ACAGTTGGGC AAAAGATATT
CAAGAAGACT CAATAGAACT TGCCGACAAA GAGTTTGTAG ATGAAAATTT CCAGCAAAGA
AGAGCAGACA TCATAGCAAA GGCAAGACTA AAAGAAAGGG AAGTATATTT TTACATTATA
ATTGAGAACC AGTCAACAGT AGCTGAAGAC ATGCCAGAAA GACTACTGAG ATACATAATC
CTTCTATGGG CAAAGAAGAT TAGAGAAGGT GTAAAGAAAC TTCCAGCAAT AATCCCAATA
GTCACATACA ACGGTCTGAA AGAGAAATGG GATGTGACAA AGGATATTAT TGAAGCTTTT
GAAATCTTCA AGGATAACAT ATTCAGATAT GAGCTGGTAG ACCTATCAAA GATAGATGCA
GAAGGACTTT TTGACAAGGA AGAGGATAGT CTTGTTCCAG TAGTGTTCTA TCTTGAACAG
GCAAGGAACG ACACAAAAGA ACTTGTAATA AGGTTAAACA AGATAAAATC TGTACTTGAG
AAAATGGGCA AATACAACAG AGAGAGATTT GCAATTCTTG TAAAGAACAT AGTAGAACCG
AGGCTAAACG AAAGACAAAG GATAGAGATT GAGAAGATAA CCAGAACTCT TTTGCAGGGG
GGAGAGAAAA TGGGTGAGTT TGTGTCAAAC ATTGCAAGAG TTTTAGATGA GTCACTGGAA
AGGAAGTTTA ACGAGGGAAT TCAACAAGGA ATTCAACAGG GAATTCAACA GGGAGAGTAC
AGGACAAAAG TTGAGATAGC TAAAAAGTTG ATATTAAAAG GAACAAGTGA TGAGGACATA
GCAGAAGTAA CAGAACTTGC TATTGAAGAA ATAAGAAAAC TGAGGAAAGA GCTTGCAAAT
TGA
 
Protein sequence
MNSNLPSKEH DSTFKLLFEN PKDILFLVRD VIGYSWAKDI QEDSIELADK EFVDENFQQR 
RADIIAKARL KEREVYFYII IENQSTVAED MPERLLRYII LLWAKKIREG VKKLPAIIPI
VTYNGLKEKW DVTKDIIEAF EIFKDNIFRY ELVDLSKIDA EGLFDKEEDS LVPVVFYLEQ
ARNDTKELVI RLNKIKSVLE KMGKYNRERF AILVKNIVEP RLNERQRIEI EKITRTLLQG
GEKMGEFVSN IARVLDESLE RKFNEGIQQG IQQGIQQGEY RTKVEIAKKL ILKGTSDEDI
AEVTELAIEE IRKLRKELAN