Gene Athe_2742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2742 
Symbol 
ID7408312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2891654 
End bp2892967 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content33% 
IMG OID643717098 
Producthypothetical protein 
Protein accessionYP_002574567 
Protein GI222530685 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGGAT TCAATCCATT TGTGAATGGT TTTAATAAAT TAAGAAATTT TACAAGGACT 
GTATATCTGT ATGGATGTTA TTCAAGAGAA GATGCAGAGA ATTTTAACAT TGCAAAAAGG
ACATTTGATG ATGAGCTTCG AAGAATTAGA ATTTTCTTAG GGGAAGACCA GTATTTTTCA
GCAGAAAAAG AAGGGAAAAA ATCCCTGCCA TGCATTGTAG AAAACTTTTT CAGAGATGTT
GAGAATCCAC TTATAAACAT ATATTTTTCG AAAACTTCTA CTGCACTGCA AACAACCTTG
TTTTTTATGG TATTGCAGAT ATTAAACTTG TCAGAAAACA AAAAAGCAAC TTTTACTCAG
ATTTCAAACG AAATATCGCA GGTACTTGAT GAAGATGTTG CTGATGCTGG ATTTGAGTCC
AGCCTGAAAA GAGTCTTAAA ACAACTTCAA AATTTGGGTA TTGTGAAATA TTTAAAAAAT
GAAAAGGTGT ATATGCTATG TTCTCAAATA AAGGATGTAT TAAAAGATTT TTCAATAGAT
GAGATAAAAG ACATTTATAT ATCTATTTTA TTTTTTATAA ACACGAATGT TCCCAACGTT
CCGGGATGGT ACTTAAAAGA AAGTCTGGAA AAATACCTTT TAGAACTTGG CGAAGAAGAG
TTTTTAAAGG ATACAAACAG ACTATTTTGG TTTACATACG TTCCACACCA CTATATCCTT
GAAGAGGAAC TTGTATGGAA ATTTTTAGAG GCAGCATCAA ACAATAAAAA GATAAAGGTT
TGGTACTATC CACGCCAAAA AAGACATTTA TCAGATTTTT CATGCATACC AGTGAGAATA
ATTTATGATG TAAAGCTTGG AAGATGGTAT TTTATGGTAT TAAGGGGAGA AGATTTATCG
GCATTGCCAG TGTGGCGCAC AGAAAAGATA GAGATTTTGC AGGAAGATTT TGACCCGCAA
AAGATTTCAC CTTTTGTAAA AAAGATTGAA AAATGTTTTT TTGTATCTGT TCCGAACAAT
AAAAAAGGAT TTAAAAAGAT TAAGATTATG TTTAAATGCC CGCTGGATTC GCCGTACAAC
TTTGTGCTTG CAAGGGTGAA AAGAGAGCTA AAAAACGCAA GAATAACCAA AATTGATGAG
AGAACATTTG AAGTGGAGCA TGATATTAGC AATATAAAAG AGTTTAAGGG ATGGCTGAGA
AGTTTTGGTG AAAGAGCTGT TGTGCTTGAC GATACTGAAG CTGGAAGAGA ACTCAAAACA
GAAATGATAA ACGAATGGAA GGAGATCCTG AGAAACTATG GAGATTTTTA TTGA
 
Protein sequence
MSGFNPFVNG FNKLRNFTRT VYLYGCYSRE DAENFNIAKR TFDDELRRIR IFLGEDQYFS 
AEKEGKKSLP CIVENFFRDV ENPLINIYFS KTSTALQTTL FFMVLQILNL SENKKATFTQ
ISNEISQVLD EDVADAGFES SLKRVLKQLQ NLGIVKYLKN EKVYMLCSQI KDVLKDFSID
EIKDIYISIL FFINTNVPNV PGWYLKESLE KYLLELGEEE FLKDTNRLFW FTYVPHHYIL
EEELVWKFLE AASNNKKIKV WYYPRQKRHL SDFSCIPVRI IYDVKLGRWY FMVLRGEDLS
ALPVWRTEKI EILQEDFDPQ KISPFVKKIE KCFFVSVPNN KKGFKKIKIM FKCPLDSPYN
FVLARVKREL KNARITKIDE RTFEVEHDIS NIKEFKGWLR SFGERAVVLD DTEAGRELKT
EMINEWKEIL RNYGDFY