Gene Athe_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0142 
Symbol 
ID7408504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp178388 
End bp180409 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content41% 
IMG OID643714546 
Productband 7 protein 
Protein accessionYP_002572069 
Protein GI222528187 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCATAT TGCTTGAAGT TGCTGGTGTG TTAATTTCCA TTTATATTTT GCTTAAGCTC 
ATAGGTCTTC GCGTCATTCC AAATGACAAA GTGGGTATTG TCGAAAAATG GTGGTCTTTC
AAAGGTTCAC TGGATGAGCA GATTATAGCG CTGCACGGTG AAGCAGGCTT TCAGCCAGAG
GTTTTAAGAG GTGGTATTCA TTTTAGGACA CCGCTTATGT ACAAAGTTCA TATTGTTCCG
CTTGTTACAA TTCCACAGGG TCAAATTGGC TATGTGTTTG CAAGAGATGG GAAACCGCTC
GAGCCTACTC AGACGCTCGG CAGAGTTGTT CCAGAGTGCA ATAATTTCCA AGATGTACGT
GCATTTTTAG AAAATGGAGG GCAAAGAGGA CCTCAAAGGG CAATACTGAG AGAAGGAACA
TATGCTTTTA ACCTTGCTCA GTTTATAGTC ATCACAGAAG ATAAGATTTA CTATCTTCCA
ATGGGCAACA AAGAAGAAAA AGAGATGATA GAGAAGATGG TCCAGACTTT AAAATCACGA
AATGCATTCA GGCCGATTGT AATAACAGAG GACAAGGTTG GGATTGTAAC TGTACACGAC
GGACCATCAC TTCCAAGTGG TGATATTGTC GCACCGACAG TTGGCGATGA CCCATCTGAC
CCTGAGACTT ATCACAACAA CTTCCAGGAT CCTGAAAAAT TTTTAAAAGC TGGGGGATTT
CGTGGAAGAC AGCTTCAGGT GCTTGTTGAA GGGACATACT TTATAAACCG CCTGTTTGCA
ACAGTTGAGC TGATTGACAA AACGGTTATT GAAGTTGGAT ATGTCGGAGT TGTTGTATCA
TATGTTGGTC CAAAAGGGCA GGATACATCG GGTGAGGACT ACAAACATGG TGAGCTTGTT
GAAAAAGGAT ACAGAGGTGT TTGGAAAGAT CCTCTGATGC CTGGAAAATA CGCTTTTAAC
ACATATGCTG GGAAAATTGT AAAGGTTCCC ACTACAAACA TAATTCTAAA ATGGATAAGT
AACCAGACAG GTACGCACCG CTATGACGAA AACCTCAAAG AAGTAAGTTT AATCACAAAA
GATGCGTTTG AACCGTCGCT GCCATTAGCA GTTGTTCTTC ACATAGATTA CAGGAAAGCT
CCTTTGGTTG TCCAGAGATT TGGGGATTTG AAGATGCTTG TTGAGCAGAC TCTTGACCCG
ATGGTATCTG CATACTTTAA AAATATAGGG CAGAAAAAGA CCCTCATTGA GCTTATTCAG
CAAAGAGATG AGATTCAAAA GATAGCATCA GCTGAGATGA AAGAGAGATT TGCTCATTAC
AACTTAGAGC TTGAAGAGGT TTTAATTGGA ACACCAATGT CATCTCCAAA CGACAACAAG
ATAGACGCAA TCTTAGAGCA GCTTCGCGAC AGGCAAATTG CCCTTGAGCA GATAGAGACG
TACTCACGCC AGCAAAAAGC AGCTGAAAAA GAAAGAGAGC TCAGAGAGGC AGAAGCTCGG
GCTGCCCAGC AAAAACTTTT GACAGAGTCT GAGATAAATA TTCAGATTCA GACAAACCAA
GGAAAGGCAG AATATCAGAG GTCACTTCAA GAAGCTCAGA AGATAAAAGC ATTGGCTGAG
GCAGAAGCAG AAAAAGAAGC ACGAATTGGT ATTGGCCGGG CAATTGCAAT AGAAGAACAG
GTAAAGGCAT ACGGCGGACC GCAATATCAA GTTCTGCAGG ATGTCATGGG CAAGTTTACT
CAGGCTTTAG AGAAAACTGG TATTGATATA GTTCCTGAGA CAGTTGTTTC AATGGGTGAA
AAATCATCTT CAGTTTCATT TAATGCGTTT GAAATGCTAC TCACTTTGCT TTTGACAAAA
GAACTCGGTG TTGAATTCAA GGCAAAAGAG ACAGAGGATG AGAATATAAA GAGGATAAAG
CAAGAAATAC TAAATTCAAT CCTTCTTGCA AAAGAAAATG AAAAAGAAGA GAAAACTGAA
CAGGTTTCTC AGGGGTTACA GCAGCAAAAT GCAGTATCTT AA
 
Protein sequence
MGILLEVAGV LISIYILLKL IGLRVIPNDK VGIVEKWWSF KGSLDEQIIA LHGEAGFQPE 
VLRGGIHFRT PLMYKVHIVP LVTIPQGQIG YVFARDGKPL EPTQTLGRVV PECNNFQDVR
AFLENGGQRG PQRAILREGT YAFNLAQFIV ITEDKIYYLP MGNKEEKEMI EKMVQTLKSR
NAFRPIVITE DKVGIVTVHD GPSLPSGDIV APTVGDDPSD PETYHNNFQD PEKFLKAGGF
RGRQLQVLVE GTYFINRLFA TVELIDKTVI EVGYVGVVVS YVGPKGQDTS GEDYKHGELV
EKGYRGVWKD PLMPGKYAFN TYAGKIVKVP TTNIILKWIS NQTGTHRYDE NLKEVSLITK
DAFEPSLPLA VVLHIDYRKA PLVVQRFGDL KMLVEQTLDP MVSAYFKNIG QKKTLIELIQ
QRDEIQKIAS AEMKERFAHY NLELEEVLIG TPMSSPNDNK IDAILEQLRD RQIALEQIET
YSRQQKAAEK ERELREAEAR AAQQKLLTES EINIQIQTNQ GKAEYQRSLQ EAQKIKALAE
AEAEKEARIG IGRAIAIEEQ VKAYGGPQYQ VLQDVMGKFT QALEKTGIDI VPETVVSMGE
KSSSVSFNAF EMLLTLLLTK ELGVEFKAKE TEDENIKRIK QEILNSILLA KENEKEEKTE
QVSQGLQQQN AVS