Gene Athe_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2242 
Symbol 
ID7407661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2377495 
End bp2379453 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content40% 
IMG OID643716608 
Productprotein of unknown function DUF1680 
Protein accessionYP_002574087 
Protein GI222530205 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATA AAGATTTTTT AGTATATTTG GATTCTCCCA GAATAAAAGA TATTTCAATC 
ACAGAACCTT TTTGGGGAAA ATATGTAGAC CTTATAAAAG ATGTTGTAGT TCCATATCAA
TGGGAGATAC TCAATGATAA TGTTGATATT CCCGTCAAAA GCCATGCAAT AAAGAATTTC
AAGATAGCAG CAGGACTTGA GGAAGGCGAA TTTGAAGGGT TTGTTTTTCA AGATAGCGAT
GTTGCAAAGT GGTTAGAAGC AGCATCATAT ATTCTTGAAA AGTATCCAAA TCCTGATTTG
GAGAAAAAGG TTGATGAGGT CATTGATATA ATTGAAAAGG CTCAATGGGA AGATGGGTAT
TTGAATACAT ATTTTACCAT CAAAGAAAAG GGTAAAAGGT GGACCAATTT AGAAGAGTGT
CATGAGCTTT ACACTGCAGG GCACATGATA GAAGCTGGTG TTGCGCATTT TCTGGCAACT
GGAAAAACAA GTCTTTTGGA GATTATAAAA AAGCTTGCAG ACCACGTATA CAGCATTTTT
GGCAAAGAAG AAGGTAAAAT CCCTGGGTAT GATGGTCATC CGGAAATTGA ACTTGCGCTT
GTAAAGCTCT ATGAAGTGAC AGGGGACAGA AAATATTTAG AGCTTGCGAA GTTTTTTATT
GATGAAAGAG GTCAAGAGCC GTATTACTTT GACATCGAAT GGGAGAAAAG AGGTAGAAAA
GAGCACTGGC AAGGATTCAA AAGGCTTGGC AGAGAGTATT TGCAGGTCTA CAGGCCTGTT
CGGCAGCAGA AAGAGGCTGT TGGTCATGCA GTCAGAGCAG TTTATCTTTA TTCTGGTATG
GCAGATGTTG CAGCATATAC ACAGGATAAA GAGCTTTTTG ATGTGTGCAA GACTCTCTTT
GATGACATAG TCAAGAGGAA GATGTATATC ACCGGTGCAA TTGGATCATC TGCTCATGGT
GAGGCATTTA CATTTGAGTA TGATTTGCCA AATGATACAG CGTATGCTGA GACTTGTGCA
TCTGTAGGTC TTATCTTTTT TGCACATCGC TTAAACAAAA TAGAGCCGCA TGCTAAGTAT
TACGATGTTG TAGAAAGGGC TCTTTATAAC ACCGTTATTG GTTCTATGTC GCAGGATGGT
AAAAAGTATT TTTATGTGAA TCCTCTTGAG GTATATCCGA AAGAAGTGGA AAAGAGGTTT
GACAGACACC ACGTAAAACC CGAACGTCAA CCTTGGTTTG GATGTGCGTG CTGTCCGCCA
AATGTTGCAA GGCTTTTAGC GTCTTTGGGA AGGTATGTTT ATAGCTACAA CCATGATGGG
ATTTACGTGA ATTTATACAT TGGCAGCAGT GTCCAGGTTG AAGTAGGTGG TATTAAGGTT
TTGCTCCAGC AAGTGTCAAG TTATCCTTTT GAAGACATGG TCAAGATAGA TTTAAAACCT
TCAAAAGAAG CAAGATTTAA GCTTTATCTT AGGATTCCAG GTTGGTGTGA AAGCTATGAG
GTTTATGTAA ATGGGAAGAA AGAAGAGCCA GAAGAACCGC CCAGCGGCTA TGTTTGCATT
GAGAGGTTGT GGAAAGAAAA TGACCAAGTT GTATTAAAGA TACCAACAGA GGTTAAAATG
GTAAGTTCAC ACCCGCAAGT GAGGAGCAAT GTAGGTAAAG TGGCAGTTGT GAAAGGCCCT
GTTGTATTTT GTGCAGAAGA AGCAGACAAT GGCAAGAATT TGCATCTGCT TTTTGTTGAT
GTAAACAGCA AGTGCAAGTT AGAATTCGAT AGCAATATTT TAGGAGGTTT GTACACAGTT
GAAGTAGATG GTTTTAGGAT GGCGGAAGAT GATTTTGGAG AAGAGCTTTA CAAGAGCCAC
AGACCGAAGT TTGTTCCTGT AAGAATAAAG CTCATTCCTT ATTATGCTTG GGCAAATAGG
GGAGTTGGCG AGATGAAGGT GTGGCTATTT GGCAAGTAA
 
Protein sequence
MSDKDFLVYL DSPRIKDISI TEPFWGKYVD LIKDVVVPYQ WEILNDNVDI PVKSHAIKNF 
KIAAGLEEGE FEGFVFQDSD VAKWLEAASY ILEKYPNPDL EKKVDEVIDI IEKAQWEDGY
LNTYFTIKEK GKRWTNLEEC HELYTAGHMI EAGVAHFLAT GKTSLLEIIK KLADHVYSIF
GKEEGKIPGY DGHPEIELAL VKLYEVTGDR KYLELAKFFI DERGQEPYYF DIEWEKRGRK
EHWQGFKRLG REYLQVYRPV RQQKEAVGHA VRAVYLYSGM ADVAAYTQDK ELFDVCKTLF
DDIVKRKMYI TGAIGSSAHG EAFTFEYDLP NDTAYAETCA SVGLIFFAHR LNKIEPHAKY
YDVVERALYN TVIGSMSQDG KKYFYVNPLE VYPKEVEKRF DRHHVKPERQ PWFGCACCPP
NVARLLASLG RYVYSYNHDG IYVNLYIGSS VQVEVGGIKV LLQQVSSYPF EDMVKIDLKP
SKEARFKLYL RIPGWCESYE VYVNGKKEEP EEPPSGYVCI ERLWKENDQV VLKIPTEVKM
VSSHPQVRSN VGKVAVVKGP VVFCAEEADN GKNLHLLFVD VNSKCKLEFD SNILGGLYTV
EVDGFRMAED DFGEELYKSH RPKFVPVRIK LIPYYAWANR GVGEMKVWLF GK