Gene Athe_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0140 
Symbol 
ID7408502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp176012 
End bp177859 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content33% 
IMG OID643714545 
Productprotein of unknown function DUF324 
Protein accessionYP_002572068 
Protein GI222528186 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAA GAAACACAGA AGCAAAACCT AACCAACGAA TGGCCGCCAA AGCACCTTAT 
AATTTTGTTC CTTTGAACAT ATGTGTAGTT AAAGCCCAAG AAGTGCCATC TTTTAGCAAA
TTTTATAAAG ATAGATACTC AGGATATATA GAACTTGAAA TAGAGACAAT AACACCACTT
TTTATAGGAA CAAAGGAAAA GTGCTCACAA TTTTTTTCGC CTGCTGGCAA ACCCCGAATT
CCTGGAAGTA CATTGAGAGG TATGACAAGA ACTCTTGTAG AAATTGTATC GTATGGGAAG
TATGGGTTTT GCGATAAAAG TAGAAGATTA TATTACAGAG CAGTTGCAGG TAGTTCAAGT
TTAGATAAAA GCTATAGAGA ATTATTTGTG GACAGAAATG ATTATTTTAA ATATAAGTTT
AGTGCAGGAA TAATGAGAAA AGAAGGAAAT ACATACAGAA TATATCCGTC AAAATTTTGT
GAGAAAACAC AGATTTATCG AATAGAATTT AACAATCTTC CCGACGAACT TAAAGATAAA
AAACCTTATT ATTTTGAAGT AGTTTACTAT AAACCAGTTT CTGTAAAAGT TCACAAGCAT
TCTAAAGTGA ATTTGAAGTA TGCTAAAATA ACTTCTATTT CCTTATCCCA AGACTCTGAG
CATCCTCAAA GGGGTTATCT TATTATTTCG GGTAACGTAG AAAAAAAGAA ACATATGCAC
TGGATTATAA ATGAACCAGA AGAACACAAC TATATTGTGA TTCCTGAAAA AAAGATTGAA
GAATACAGAA ATGATGAAAA AAGAGATCCT TCATTTGATA TTCTCAAGAT ACTTAATGAG
TGTGGTGAAG TTCCAGTATT TTTTATTACT GACCAAGCAA ATAATGTTAT AGCCTTTGGA
CATACAGGTT TTTTCCGCTT GTCTTATGAT TATACAATTG GAGAGCATAT TCCAAAAAAT
CTACAATCAG ATGATGTTAT TGATTTTGCT GAAGCTATTT TTGGGAAAGC AGGCCAAACA
AATTCTTTTG CTTCCAGGGT CTTTTTTGAA GATGCGGAAT TGATTGAAAC TCCAGAAAAT
TTGGAGAATA TATTTTTAAC AGAAACCTCA CCTAAAATTT TGAGTGCACC GAAACCTACA
GCTTTTCAGC ATTATTTAGA ACAACCAGAA GGTGTTCAGA CATCAAAAGA TAAACTGCAT
CATTGGAATA CGAAGGAAGC TAAAATAAGA GGATATAAAC TTTATTGGCA CAGGAACACA
CCTGATGAGC CATACCATGA GCATAGTTGG AGTGAAGGAA AAATTATAAA AGATTCAGAA
CAGCATACTG TTATCAAGCC CATAGGCAGA GGAGTGAAAT TTAAATCTCG CATCAGATTT
GAAAACTTAT CGAAAGAAGA GCTTGGTTGT TTACTATTTG TTTTGGATTT GCCAGATGGT
TACTATCACA AAATTGGAAT GGGGAAACCT CTTGGACTTG GAACTATAAA AATAAAGCCA
ACATTATTCC TCATAGATAA AAAAATAAGA TATAGCTCTC TCTTTCATGA AGATGAATGG
GAATTAGGGA TTGAAAGGAA AGAAACTTTG CAAGATTATA AAAGTGATTT TGAAAAATAT
ATTATGCGAA ACATTCCAGA TGAAGAAAAA GATAACGCAA ATTCATTGTG GGAAACTAAG
CGCTTGAAGG AGTTAAAAAT ATTGCTTTGT TGGGAACACA ACAATTGTGT AGGATGGTTA
GAAAAAACAA GATACATGAC CATAGGTGAC AGGGCGAAAA AAATCGAAAA CGAATTTAGG
AAACGAACTG TGCTGCCAAA ACCTTCTGAA GTTATTCAAG GAGATTAA
 
Protein sequence
MDKRNTEAKP NQRMAAKAPY NFVPLNICVV KAQEVPSFSK FYKDRYSGYI ELEIETITPL 
FIGTKEKCSQ FFSPAGKPRI PGSTLRGMTR TLVEIVSYGK YGFCDKSRRL YYRAVAGSSS
LDKSYRELFV DRNDYFKYKF SAGIMRKEGN TYRIYPSKFC EKTQIYRIEF NNLPDELKDK
KPYYFEVVYY KPVSVKVHKH SKVNLKYAKI TSISLSQDSE HPQRGYLIIS GNVEKKKHMH
WIINEPEEHN YIVIPEKKIE EYRNDEKRDP SFDILKILNE CGEVPVFFIT DQANNVIAFG
HTGFFRLSYD YTIGEHIPKN LQSDDVIDFA EAIFGKAGQT NSFASRVFFE DAELIETPEN
LENIFLTETS PKILSAPKPT AFQHYLEQPE GVQTSKDKLH HWNTKEAKIR GYKLYWHRNT
PDEPYHEHSW SEGKIIKDSE QHTVIKPIGR GVKFKSRIRF ENLSKEELGC LLFVLDLPDG
YYHKIGMGKP LGLGTIKIKP TLFLIDKKIR YSSLFHEDEW ELGIERKETL QDYKSDFEKY
IMRNIPDEEK DNANSLWETK RLKELKILLC WEHNNCVGWL EKTRYMTIGD RAKKIENEFR
KRTVLPKPSE VIQGD