Gene Athe_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2019 
Symbol 
ID7408231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2129997 
End bp2132099 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content31% 
IMG OID643716386 
Productprotein of unknown function DUF87 
Protein accessionYP_002573870 
Protein GI222529988 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATGA GAAAAGTAAT AAATGAAATA AGTAAGGAAA GCGAAGTGGT AAAACAACTT 
CTTAACATTG TTCAAAATGC TAGGTTTATT GGTTATGCTA TTGATGTTTC TTACTCCTTC
ATGACTGTTC TTACAAATGA TGCATGGAAA GAAAGAGCAA ATGGTTTGCC ACACAATAGC
TTTTTATTTG CCGCTTCACC AAGATGGTTG ATTTATGACA AGGACACAAA TGATTTTAAC
ATTGATCCCA CCAAAGAAAT ACCAGAGATT ATATTACTCC GTGTGACAGA AGAATATGAA
TTACCGAATG AGGATGTTTG GTTAATGGCT AAAATTGATA AATTTAAGAA CGTAGGAACT
CGTGAATTAA AAGAGGATTT GAGTTTTGAT GATTTTTCTC GAAATGAAAT ACAATATGCT
GGCTTAAAAT GCCGAATTCT TGGTACATTC TATCCTTCTG AGAATAATTC ATTGGAATTT
GGCTCTGATT TGGAAAATTA CTATGGTGCA AAAATTTTAT TTGTTTTTAA ACCTTCAAAT
GAAGGTTTAG AAAGCATTAT AAATTTTGCT GTTATGAAAA AACAAAATGA AGTTCTCCAA
AATGCGTCTT TAGTTCCAAT TGGTTATGTT CGATATACTT CTACATGTCG TTTACAAAAT
CAAGAACCTT CTAAAGCACG AGTATACATC AATCTTGATG ACTTTATTAA AAGGAGAACA
GCATTATTCG GTATGACTCG AACGGGAAAA TCAAATACTG CTAAAATTTT AATTAAAGCT
ATTCGCGAAG CTGCTCAGAA GTCTGGACTA AAAGTATCCC AAATAATTTT TGATATTAAT
GGTGAATATA TATATCCTAA CAAACAAGAT GAAAATAAGT CAATTTCTGT CGAGATAGAA
AATTGTTTTG TTTTAACTCT AAATCCGAGA GCTCTAAGTA GTGAAAATCA AGAAATTCAA
CCGTTAAAGT TTGATATGTT AAAGAATTTA AGTCTTGCCC ATGAACTTAT TCGGGCATTA
GCAGAAAAAG AAGGAGCATT AAGTTATTCA ACAGACGCAC AAGCATTCCT TAACGTTGAT
ATTAGCGCAT ACGAATATGA TTTAAAAAAT GGACAGCCTG AAGAAAAGAA AAGGGCTAAA
AGAATACTTG AAGTTTACAA ATTAATTTTG GCAAAAAGTT TAGATGAACC AAATGTTGAA
TTTGATAAAA ATGTTTTTGG TCAGACAGTA TATTCCGAGA TGGAAAATAT TCTTAGTAGT
ACAGATGATA ACCAAGATGA AAAAGGTAGA ATAAAACAGG ACCTAAAAGA AAGGTTGGAA
CGTTTGCAAC GTCTAAGAGA TATAACAAGA AAGAGCAGTT TCACAATAGA TGAATTTGAC
TTTATTTTGG ACACAATCCA TTTCATTTGT ACCAAGTTAG GTAAAAACAT AAAAACCTCT
AGCGGAAACA ATCTTTACCG AGGTGATTTT GAAACACTTG TAAATTTCGC TGTAAGACGA
AATTCTTCAG GACAAACTAT TCTTGGTTAT TCTTTACTGC GAAAAATTCA AATAAAAGAT
TATCACCAGA AAGATAAAAG CAATTATATT CAAACGATAA TTGAAAAAGT TAGGAATGGA
GATGTTGTTC TAATAGATAT GGTTTACGGT AACGAACGAA TGAGAAAAAT TATAAGTTCT
AAGATTGCTT ACGAAATATT TAACTACAAT CAGCAAATTT TCACAAGAGC AGAAGAACCA
CCATACGTCA TTTTTTACAT TGAGGAAGCC CATAATTTAA TTGGCAAAGA CATGGATGTT
ACAGATATAT GGCCAAGAAT TGCAAAAGAA GGTGCTAAAT ATAACATAGG TCTTGTCTAT
TCAACACAAG AACCATCAAC TATAAACAAG AATATTCTTG CAAATACTGA AAACTGGTTT
GTTACACATT TAAATAATGA AGAGGAAATT AAAACTGTTG CCAAATATTA TGATTTTGCT
GACTTTAAAG AATCTATTTT GTTAGCGAAG GATGTGGGTT TTTGTAGAAT GAAAACTTTA
TCTTCTCCTT TTGTTTGTCC TGTTCAAATT TATAAATTTT CAGATTTTAC ATTAAATAGA
TAG
 
Protein sequence
MDMRKVINEI SKESEVVKQL LNIVQNARFI GYAIDVSYSF MTVLTNDAWK ERANGLPHNS 
FLFAASPRWL IYDKDTNDFN IDPTKEIPEI ILLRVTEEYE LPNEDVWLMA KIDKFKNVGT
RELKEDLSFD DFSRNEIQYA GLKCRILGTF YPSENNSLEF GSDLENYYGA KILFVFKPSN
EGLESIINFA VMKKQNEVLQ NASLVPIGYV RYTSTCRLQN QEPSKARVYI NLDDFIKRRT
ALFGMTRTGK SNTAKILIKA IREAAQKSGL KVSQIIFDIN GEYIYPNKQD ENKSISVEIE
NCFVLTLNPR ALSSENQEIQ PLKFDMLKNL SLAHELIRAL AEKEGALSYS TDAQAFLNVD
ISAYEYDLKN GQPEEKKRAK RILEVYKLIL AKSLDEPNVE FDKNVFGQTV YSEMENILSS
TDDNQDEKGR IKQDLKERLE RLQRLRDITR KSSFTIDEFD FILDTIHFIC TKLGKNIKTS
SGNNLYRGDF ETLVNFAVRR NSSGQTILGY SLLRKIQIKD YHQKDKSNYI QTIIEKVRNG
DVVLIDMVYG NERMRKIISS KIAYEIFNYN QQIFTRAEEP PYVIFYIEEA HNLIGKDMDV
TDIWPRIAKE GAKYNIGLVY STQEPSTINK NILANTENWF VTHLNNEEEI KTVAKYYDFA
DFKESILLAK DVGFCRMKTL SSPFVCPVQI YKFSDFTLNR