Gene Athe_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1603 
Symbol 
ID7409433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1702074 
End bp1703999 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content30% 
IMG OID643715972 
ProductUvrD/REP helicase 
Protein accessionYP_002573470 
Protein GI222529588 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATAC CAAAGGAACA ATGGAAACCT GCTGATGGAA TTGAACTTGA AAAAAATGCA 
GAGATAGTAG TTAAAAGTAA TTGCAACTAT TTAGTAATAG CTGGACCTGG CTCAGGTAAA
ACAGAGTTGC TTGCTCAAAG GGCTTGTTTT TTACTTCAAA CGGGTATTTG CTATTCACCA
AAAAGGATAC TTGCAATCAG CTATAAAAAA GAAGCTGCCA AAAATTTAGC AGAAAGGGTT
CAAAAAAGAT GTGGTCAGAA ATTGGCGAAT AGATTTGTAT CACTAACTTT TGATGCTTTT
GCTAAGAGCC TTTTGGATAG ATTTCTGTAT GGACTTCCTG AAGTATACAG ACCTGATGTT
AATTATGATA TTAATCCTTC TGTATTAAGA GAGGGGAAGC GAAGCATTTT GTTAAGATTT
GAGGGAAAAG AACCGAAACA TTTACTTACA GGTGATTATG TTGATGGAAT TCATAGTCTT
TCAATAGAAA ATATTATTAA TAAAAAAATG ACAGATAAAA AGCTTCCCTT TAATAATGAA
CCTAACAATT TAGAAGAATG GATGATTAAC GAATTATGGA AATTATTTTT GAAAGGAGAT
AAAGATTTTA AACCTGCTCT GACTTTTCCA ATGATTTCAA GACTTGCTGA ATATATCTTG
AGGGAAAATC CTTTTGTTCT TAAAGCATTG CGAGCGACAT ATTCACATGT ATTTTTAGAT
GAATTTCAGG ATATAACAGG AGTACAATAT GATTTATTGA AAACTTGTTT TCTCGGGTCT
AATGCCGTAA TAACTGCAGT AGGAGATAAT AAACAGCGAA TTATGGAATG GGCAGGAGCA
CTAAGAAATT CATTTGAATT GTTTGAAAAA GATTTCAATG CCCAAAGAAT AAATTTACTG
ATGAATCATA GGTCTGCGCC GCGTTTAGTT GAATTGCAAA AAGTTTTTAT AAAAGAAGCA
TTTGGTGAAA AAATAGATTT GTCCGATTTA AAAATAGAGA ATAGCAAAAA ATGGAGTAAA
GATGATGGAA TATGTGAAGT TTGGATTTTT AATGATTATA TTAAAGAAGC TCAAATTTTA
TCTGAAAACA TAAAGAAGTG GCTAAATTCA GAAGATTTAA GTCATAGAGA TATTTGTGTT
TTGGTAAAAC AACGTCCTTC TAAATACACA AATATGCTGA TTAAAACATT AGCTGAAAAT
AGCATGAATG CAAGAGATGA AGGCGAAGTT TCAGATTTAT TAAATGAGGA AATAATTCAG
TTTATAATTA ATTTATTTTC ATTAGCATTA AATATTAATG GAAATAATTC ATACACATTT
GTATACGATT TTATAAAAAT ATTAAATGGA TATGATGATG ACACTCCAGA GAGGTTTTTA
ATTAAACTTG AATCTGAAAT AACAGAGTTA TTGAGGTTTG CAAACGAAAA ACTACTTAAG
ATTGATAATC AGAAACAAAT TGTAGAATTA TATAAAATTA TAATTAGATA TTTAAAAATG
GATAGATTAA TATCCTATTT TCCGCAGTAT AGAAATACAG CATGGTTTAA TGATTTAATG
AATAAGTCTA TTAAATTATT GTGGAATGAA TTTGAAATAA CAAATAATTG GTCAAAAGCA
ATAACTAACT TTATAGGAAT AAACAGTATT CCTATTATGA CTATTCATAA AAGTAAAGGG
CTCGAATATG ATACTGTTGT ATTCATTGGT TTAGAAGATT CAGCATTTTG GAGTATTAAA
AATCAACCTG AACAAGATAC TTGCGCCTTT TTTGTAGCTT TATCGAGAGC GAAAAGACGT
GTAATAATAA CTTTTAGTAA ATATAGAGAT GTCGGAGCAA ATCCTTGCCA AACTGCAGAA
AATGTTAAAA AATTTTATGA ATTACTTGCC AAATCGGGTA TTGTAGAATA TAGAGATTTT
ATGTAA
 
Protein sequence
MLIPKEQWKP ADGIELEKNA EIVVKSNCNY LVIAGPGSGK TELLAQRACF LLQTGICYSP 
KRILAISYKK EAAKNLAERV QKRCGQKLAN RFVSLTFDAF AKSLLDRFLY GLPEVYRPDV
NYDINPSVLR EGKRSILLRF EGKEPKHLLT GDYVDGIHSL SIENIINKKM TDKKLPFNNE
PNNLEEWMIN ELWKLFLKGD KDFKPALTFP MISRLAEYIL RENPFVLKAL RATYSHVFLD
EFQDITGVQY DLLKTCFLGS NAVITAVGDN KQRIMEWAGA LRNSFELFEK DFNAQRINLL
MNHRSAPRLV ELQKVFIKEA FGEKIDLSDL KIENSKKWSK DDGICEVWIF NDYIKEAQIL
SENIKKWLNS EDLSHRDICV LVKQRPSKYT NMLIKTLAEN SMNARDEGEV SDLLNEEIIQ
FIINLFSLAL NINGNNSYTF VYDFIKILNG YDDDTPERFL IKLESEITEL LRFANEKLLK
IDNQKQIVEL YKIIIRYLKM DRLISYFPQY RNTAWFNDLM NKSIKLLWNE FEITNNWSKA
ITNFIGINSI PIMTIHKSKG LEYDTVVFIG LEDSAFWSIK NQPEQDTCAF FVALSRAKRR
VIITFSKYRD VGANPCQTAE NVKKFYELLA KSGIVEYRDF M