Gene Athe_2353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2353 
Symbol 
ID7407772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2494947 
End bp2496950 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content38% 
IMG OID643716717 
Producthypothetical protein 
Protein accessionYP_002574196 
Protein GI222530314 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAG AACTGGTGCT TGTCCGCAAC GAAACATTTA AAGACTTTCC CATTGGTAAT 
TTCCCATTTG ACCCTAATCA CTCTGCCATG GGTGAATATC ATTGTTATAT TCCTGAAGGT
TACCGTGGTA GATGGTACGA CCCTGTTGTA TATCATGGTT GGAACGGAAG TGGCCCTACC
TGGATTGTGA CAGAGGAAGA TGGTAAAAAG TTTATGGAAC AGACAAGGGT GCTCGCCGCC
TTAAAGAATT TGTGGCCAAT GCTTGTAACA GGAGATGAGT TTTGGCAAGA CTATATAGTA
GAGGCTCAAG TTAGGATGCT TTCAACCTTG CCTTCAGCCC ATGCAGGAAT GATGTTTAGA
TATGTACATG CACGCAGTTT TTATGCTCTT TTGCTTCATG ACAAAAAGCT CAAACTTATC
AAAAAAGACC AGGAAAATCT TGAAGTCTTG GCTTTCTGCA ATTTCAATTA TGATTGCGAT
ACATTTTATA ATCTAAAAGT GGAATGTTGT GGCAGCAAAC TAATTGGATA TGTAGACAAC
AAAAAGATGG TAGAAGCAAT TGATAGCTCA TATACAAGAG GAAAAATCGG GATTACAGCA
ACAATGCCTG CACAATTTAC AGATGTCAGG GTGTTGATGA ACCATCAAGC TTATAAAAAT
TATTTGATAG CAAAAAACAA TTGGGTTCAA GTTGAAAAAA ATCTTCAATG CGAGTATCCT
CAGCCGGTAT TATGGAAAAT AATTGATTTT AAAAACTTTG GTGCAGGCAG GCAAATAAGA
TTTGGAAATT TACCAGAAAA CAATCAGAAG TTTATGGTAA TAGCCCAGCA TCAAAAGCGG
GTGCACAGAG ATGCATACGC CAATATCAGC TGCCTTACCG CAATTGATTT AGATGGCAGA
ATTCTTTGGC AGATAGGTGA AGCATCTTCA CAAAAAGACC ATGCATATCT TACAGCTGAC
CTTCCTTTTC AGGTGTGCGA TGTTGACCTT GACGGGGTTG ATGAGGTAAT TGTTGCAAGA
AACTTTAAAA TCATGATTTT AGACTCTCTT ACAGGAAAAA TAAAAAAGCA AATTCCTGCG
CCCTTTTCTT ATGAAACTGA CAAACTGTAC TCTGTGCCAT TTGGCCAATA TGCCTTTGAT
AGGGTCAATA TTGACTCAAT CAGAATTGCA AACCTAACTG GCAAATCTAA GCCAACAGAT
ATAATTGTAA AAGATAGGTA CAGCAGGCTT TGGGCATATG ACAACAATCT TGATCTTCTG
TGGAAGTTCA ATGGCAAAAA CACAGGACAT TTTATGTTCA CCAAAGACAT TGATAATGAT
GGCAAAGAAG AGGTTGTTGT AGGTTATCAT CTTTTAGACC ATGATGGAAA GCTTATCTGG
ACTTTACCCA TTGAATCTGA CCACACTGAT GAGATTGTAA TAGGTCCAAT TGACCCTGAG
AGAAATGAAG ATATCATTGC GATGGCATGT GGTGATGAAG GCTTTATACT CTCTGACCTT
AAAGGCAATA TTATAAAGCG ACATTTAATA GGTCATGCTC AAAGAATAAG TGTTGGGAAC
TACCGACCTG ATCTTAAAGG CTATGAGATA TGCGTAACTA CATACTGGGG ATATCAAGGA
ATAATATACA TCTTTGATTG TAAAGGGAAT CTTCTGCATC AGTTCGAATC ACCAGTACCT
GGAAATATAA TAACACCTGT CAACTGGACT GGGGACGGAA AAGATTTAAT ACTTCTGAGC
GGTCATATTC AGTATGGTGG GCTGATTGAT GGTTTTGGAA GAAGGGTTGT AACTTTCCCT
GATGATGGTC ATCCAATCCT TTGTGCAGAT AGCATTGATT TGACAGGTGA TGGCAGAGAT
GAGATATTGC TGTGGGACGA AAAGAAAATG TATATTTATA CCCAGCAAGA CAATGGCATT
AAATCAGACG TGCTTGTTCC AAATAAATAC CCTACTATAA ATGGCTCAAA CTATAGAGGA
GAGTACTCTT TTTATAACAT TTAA
 
Protein sequence
MKEELVLVRN ETFKDFPIGN FPFDPNHSAM GEYHCYIPEG YRGRWYDPVV YHGWNGSGPT 
WIVTEEDGKK FMEQTRVLAA LKNLWPMLVT GDEFWQDYIV EAQVRMLSTL PSAHAGMMFR
YVHARSFYAL LLHDKKLKLI KKDQENLEVL AFCNFNYDCD TFYNLKVECC GSKLIGYVDN
KKMVEAIDSS YTRGKIGITA TMPAQFTDVR VLMNHQAYKN YLIAKNNWVQ VEKNLQCEYP
QPVLWKIIDF KNFGAGRQIR FGNLPENNQK FMVIAQHQKR VHRDAYANIS CLTAIDLDGR
ILWQIGEASS QKDHAYLTAD LPFQVCDVDL DGVDEVIVAR NFKIMILDSL TGKIKKQIPA
PFSYETDKLY SVPFGQYAFD RVNIDSIRIA NLTGKSKPTD IIVKDRYSRL WAYDNNLDLL
WKFNGKNTGH FMFTKDIDND GKEEVVVGYH LLDHDGKLIW TLPIESDHTD EIVIGPIDPE
RNEDIIAMAC GDEGFILSDL KGNIIKRHLI GHAQRISVGN YRPDLKGYEI CVTTYWGYQG
IIYIFDCKGN LLHQFESPVP GNIITPVNWT GDGKDLILLS GHIQYGGLID GFGRRVVTFP
DDGHPILCAD SIDLTGDGRD EILLWDEKKM YIYTQQDNGI KSDVLVPNKY PTINGSNYRG
EYSFYNI