Gene Athe_1602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1602 
Symbol 
ID7409432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1698593 
End bp1701835 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content37% 
IMG OID643715971 
Producthypothetical protein 
Protein accessionYP_002573469 
Protein GI222529587 
COG category[R] General function prediction only 
COG ID[COG1483] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACTC TTTTCGAGCT TTGTAAACCC AGGGATGATG TGTTCACAAC AAGAAGCTCA 
GAAGATGTTC AGGAAATCTC AGACCTTCTT TCTGACAAAA TCAAGCCGGA AAAGTTTTTT
GAACAAACAT ACATAACAAA CGGTATGAGA AGATTATTTG ATGTTGCATT CAAAAGATTT
AGTGGAATTG ATGATGAACA AGGTGTAATT GTTCTGCGAC AGGCAATGGG TGGTGGCAAG
ACCCATAACA TGATTGCTTT GGGGCTTTTG GCAAAGTATC CGGAAATCAG AAAAAAGATT
TTAGGTGATG ATTATCCATA TATGTACACA GGTAGAATAA GACTTGCAGT ATTTACTGGA
AGATGGACAA ATTCAATCCC ATGGGTAGAA ATAGCGAACC AGCTTGGCAA GCAAGAGGTT
TTGAACAAGT CCCTTCAAAA TCATATGTTT GCACCTGGCG ATAGAGAGTG GACAGAGCTA
TTAGCAGGGG ACCCGCTTTT GATACTGCTT GATGAGCTTC CGCTATATCT TGAGTATGTC
CAGAGTATTC CAGTTGGTGA GAGTAACTTT GGAAAGGTTG TTTCAATTGG GCTTTCGAAT
CTTTTCAATG CAATCCAGAA AAAAGAGCTT TCAAACGTAA TGGTGGTTGT GGCAGACCTT
CAAGCAGCGT ATGAGCAAGG AGGGAAACTT TTACAAAGTG CTTTAGCAAC AGAGCTTGAC
AAAGAAGCAA CCCGTGTTGC CTCTTCTATC CAGCCGGTTG ATATGGTCAC AAATGATGTA
TACTGTATTT TGAGAAAAAG ATTTTTCAAA GAGTATCCTC AAAACCCTGA AAATGACCCA
GATGTAGAAG AGATTGCAAA AGCCTACAAA GAGGTTATTG AAGAAGGTAA TAGAAAAGGA
CAGACTACAG TTGAGCCAGA GCGAATTTAC ACTGAGATTA AGAGAACATA TCCGTTTCAT
CCTGCAATAA TGGAGCTTGT TAGCAGATTC AGGGAAAATG TCAATTTCCA GCAAACACGA
GGACTTATTC GTCTTATGAG GCACTACATA AGATATCTTT ACAGTGAAGA AGGGAAACTT
GCAAAGCAGA AGTATTTAAT TGAACCATAC GACTTTGACC TGAGCGATTA CAACACGCGC
GAAGAGATTA TAAATATAAA GCAGAGTTTA GAAAATGCCA TTATGCACGA TGTTTATTCA
ACAGCTCATA TTACAACTGC AAAGGCTATT GACCAAAAGT ACAATACTAA CTTAGCAAGT
GAAGTTGCAA AATTGATTTT GCTGTCTTCG CTGTCTGAAA TTACAAAATC TCCATTAGGT
CTTACAACAA GTGAGCTTTT TGCATATATG AGCTCTCCAA ACAAAGATTT AAGCAGTCTT
GCGAACATAA TTGAAGAATT TAAGCAAAAT TCATTACATA CAAGAATTGA TGCAAATGAC
AGAATATATA TTACACCAAT GGAAAATGTA GTTGCGCGCA TGAGAAAATT CAAAAGTATG
TATACATTAG ATGAAGCGGA GGCAGTGTTA GAGAGGCTTC TTTATGAATA TTTTTCACCA
AGGAATCTAA ACTGCTATCA GAAGGTGTAC CAAAAGATTG TCGCTTTGCC AAATGACATA
TCAAAAATCA ATGTGGATAT GAACAATGTA ACGCTAATTA TTTCAAAGCC TTATGACTCA
AAAGGGTCTC TTAACCCAGC ACTGATTGGT TTTTGGCAAA ATCAAACTTA CAAAAATAGA
CTTCTCTTTT TAACAGGTTC TACTACCATG TATAATACGC TTTTAGAGAA AATAAGAGAG
TATATGTGTT GGGAAAACGT GCTAAATGAG CTCAGAAAAG AGGGTGTTCC AGAGACTGAC
ACAGAATATC AACGAGCAGA AAATGAACAG AGCAAGATGA AAACTGCAGT TTTAGAAGTC
TTGAGAGAGA CTTTTAACAA GATTTACTAT CCACAGAGGC CATATACTAA ACAGAACGCT
GAACTTATTT CTGAAGATTT AAAATATCTG CCAGCGAGTG AAAATAGTGA TTCTCAAGGT
TCAAAAGGGA TAGACAGAGT TAGAGCTCTT CTTGGCAACA ACCGAGATGG AGAGATTGTC
ATTCAAAAAG TGTTGCTTAA CAAAAAGTAT ATAGAAGTAA ATGACATTGA TGAATTTCGC
GAAAAGGTAG AGAATTTGCT CTTCACAAGA GAGTCAATGT CTTGGTCCGA GATTGTTGAA
AGAGCAGCGC GAGATGCAAG GTGGTTTTGG CATCCACCAG AGGCCCTGGA GAATTTAAAG
CGCGAGATGC TTTCAAAAGG TTTGTGGAAA GAAAGTGGAG GGATTGTTTT AAGAGGTGAG
GCAGCAAAGG ACAAAACAAC AGTAAAAGTT CAGTTTGTAT CCCCTGACTT TGACACAGGC
GAGGTAGTTT TAAAGCTCAT ACCAATGTTT GGTGATGTTG TACATTATGA AGAGGGTGAT
AAGGAGGTAT CAGAGAACTC ACCGGTTGTG CCAAATATTA ATGAGTTCAG GACAAAAGCA
GTAAAGCTCA GTTTCCTTTG CATTGATTCA ACAGGGTATC ATCCAAAAGG TGATGTTGTA
AAATATGAAT GTGAAATTTT ACTTGATGAC AATATAAGAT TTTGCGACCA AGAAGGCAGA
GCTCACATAA AAATAAGAAC AGCTCCGCAG GCAACTGTAA AGTATACAAC AGATGGTTCA
AACCCGCGCT ATAATGGCAA AGTTGCCGAA AATGGTGATA TTACAATTCC AGATAATACA
AGGGTAATTA ATGTGGTTGC AGAAAAAGAT GGTGTTTTTT CAGAGATAAA AAATATTCCT
GTGAGAAGAG ATGTTACAGG AAACATAGTA ATTAAGCAAA TAGAACTTGA TTATACAAAA
CCGGTTAAAT TAAAACTTTC AGGTAGCAAG AAAATTCTGA TTTTTAGTAT AGAAGAACTT
CAAAGAGAAA TTGAACTATT GAAAAATTAC AAAGGAAAGT TTGTTGCCTA TTCTCTTGAT
ATTTATAGAG ATGATGATAA CTATATTGTC CTGACATGTA AAAAGCCACA AGGGGTAGAA
CTAAGCGAGA TTTTTACACA TATTGAAAAG GTCAAAACCG ACTTCTACAA TGATGGGATT
CAAAACGTGA GAGCTGTTAT TACAGAACTG TTATTTGAAG ATGCAAAAGA CTTTGAAGAA
TGGATAAAGC AAAAAGGCAA GACGCTTCAT GACTTCAAGG AGGCAATAAC ACAGAATGAG
TAA
 
Protein sequence
MKTLFELCKP RDDVFTTRSS EDVQEISDLL SDKIKPEKFF EQTYITNGMR RLFDVAFKRF 
SGIDDEQGVI VLRQAMGGGK THNMIALGLL AKYPEIRKKI LGDDYPYMYT GRIRLAVFTG
RWTNSIPWVE IANQLGKQEV LNKSLQNHMF APGDREWTEL LAGDPLLILL DELPLYLEYV
QSIPVGESNF GKVVSIGLSN LFNAIQKKEL SNVMVVVADL QAAYEQGGKL LQSALATELD
KEATRVASSI QPVDMVTNDV YCILRKRFFK EYPQNPENDP DVEEIAKAYK EVIEEGNRKG
QTTVEPERIY TEIKRTYPFH PAIMELVSRF RENVNFQQTR GLIRLMRHYI RYLYSEEGKL
AKQKYLIEPY DFDLSDYNTR EEIINIKQSL ENAIMHDVYS TAHITTAKAI DQKYNTNLAS
EVAKLILLSS LSEITKSPLG LTTSELFAYM SSPNKDLSSL ANIIEEFKQN SLHTRIDAND
RIYITPMENV VARMRKFKSM YTLDEAEAVL ERLLYEYFSP RNLNCYQKVY QKIVALPNDI
SKINVDMNNV TLIISKPYDS KGSLNPALIG FWQNQTYKNR LLFLTGSTTM YNTLLEKIRE
YMCWENVLNE LRKEGVPETD TEYQRAENEQ SKMKTAVLEV LRETFNKIYY PQRPYTKQNA
ELISEDLKYL PASENSDSQG SKGIDRVRAL LGNNRDGEIV IQKVLLNKKY IEVNDIDEFR
EKVENLLFTR ESMSWSEIVE RAARDARWFW HPPEALENLK REMLSKGLWK ESGGIVLRGE
AAKDKTTVKV QFVSPDFDTG EVVLKLIPMF GDVVHYEEGD KEVSENSPVV PNINEFRTKA
VKLSFLCIDS TGYHPKGDVV KYECEILLDD NIRFCDQEGR AHIKIRTAPQ ATVKYTTDGS
NPRYNGKVAE NGDITIPDNT RVINVVAEKD GVFSEIKNIP VRRDVTGNIV IKQIELDYTK
PVKLKLSGSK KILIFSIEEL QREIELLKNY KGKFVAYSLD IYRDDDNYIV LTCKKPQGVE
LSEIFTHIEK VKTDFYNDGI QNVRAVITEL LFEDAKDFEE WIKQKGKTLH DFKEAITQNE