Gene Athe_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1944 
Symbol 
ID7407358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2050960 
End bp2054565 
Gene Length3606 bp 
Protein Length1201 aa 
Translation table11 
GC content37% 
IMG OID643716316 
ProductFibronectin type III domain protein 
Protein accessionYP_002573804 
Protein GI222529922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCCA TCGCAATTAA AATAGTTTCG ATTTTCCTTT TAATATCTTT TTTGATTGCT 
TTGGTTCCCC AAAATCTAAT AGCACAGCCT GCAATTGTTC TTCCTGCACC TCAGGGTTTG
AATATAGCAG TTGTAAATAA TCTTCCGAGG ATGGAAGTAG CTCAGGATGG AACTATAACA
CTTTATTTTT CTTGGCAATA CAGTTATTCT GACTTTGATT ACTTTGAGCT GTATCTTGGT
GATGACCCCA CAAGATTTCC TTATGGATAT ACCATATACA AAAATGACCC AAACCTCACA
GTTTCAAATG GCAATTATAC TTATAAAGTT CAGAGTCTTC CGAATGGGCA AAAGATTCCA
AGCGGAACAA TTTTTTATGC CAAGGTAAGA AGTGTGAGAG TTTTACAGGA ACAGACAGGA
AATGTTGTTT ATTACTCTTC GTATTCAAAT ACAATTGTGT TTTTGACACC TATTTTTGTT
GAATGTTATA CAAATGCTGA AGATGCCATT GATATAGTGT GGGATGATGT TTACTATTCT
GGCAAAAGAA TAGATTATGA CATATATGTG TCAAAAGACA TAAGCTTTAC AACTCCAACG
AAGTATCAGA TAGATGGTGA CAGGATAACC TTACTTCAGA GCCAAAAACC GGGTGGAAGA
GTTGAAATTC TGCCAGGCAG GAAACTCAAA TATACAGCAG CAGGACTTTC GCCAAGTAGC
CTTTACTATG TAAAAGTTTT GCCGAGAAAT TTGCCTCAAG AAGTTATCTG GCGTGACCCA
CAGACATACA CACCACCAAA TATTAAGGTA ATCGGTGAGG CGGCAACATA CATTCAGGCG
GAGGCTCAGA GAATTGCTGA CAATGTTGTG TGGTTAAGGT GGGCAAAGGT ATCAATTGCC
GAAAACGAAT ATGAAATTTA CAAAGGTAGT AAAGATCAGA TACCTACTTT AATTGGTACA
GTTTCGGCGA ACGAGTTTTT TGCTGTTGTT AGTATCACAG ATGATGTGTT TTTCAGAATA
CAGGTTGATG TTTTTGATAG TTTTGGCAGA AAGGTGTCTA TCAGATCAAA AGACTTGTAT
GTTCATCCAT ATACACTGCC TTTTGCACCA CCTGCGGCAG AAAATTTGAC TGCTTTTCCG
AAGTCTCAAG ATACAATCTC TTTAAGATTT AAAATACCAA CAGATAAAGA TGTTGTGTAT
GATTTTTATT ACAAAAAATA TTCGGATAGC AATGTTGATT TTACTCTTTT TGTATCCAAT
TATCAGATGA AAAGTTCTGA TGAGGAAAAG GACGAGAACA ATCTTCCCAC AGGATATTAT
AGGTTTGACA TCACAGGTCT TGAAAAAAAC ACTGTTTATG TTTTAAAGGT TGTGGTGAAG
AAGAGGTTTT ATGATTATGA ACAGGGGACA TACATTTACA AGGAATCAAC CCCTGCTTTG
ACAATTTCAT ATACTACTTC TGGCGACATA ACTCCGCCAA CCACTCCAAC ACTTTTGTCT
GTTGTATATA CAACTTACGA TTCTGTAGTT TTGTCATGGC AGCCTGTAAC TATTGCAGGT
ATCCAGCCGC CGACTGTAGA CAGGAGCATC TTCTATGAGG TCAACTTTGC AGTGTACCAG
GTTGGAATGG ATATAACTAA TCCAGAAAAT CTTGATATAG CAAGTTTCCA AAAGATATCA
CTTTCTGACT ATCAGGTAGA TCAATCAGGC AAGATAGTTT TCAGGGTAAG CAGTCTCTTG
CCAAACACAA GGTATGTATT TTTTATAAGA GCTGTGAGAA AGATTGACAG TAGTGTATAC
TATTCATTAC CATCCAATGT TGTAATGGCA ACAACGCTGA TAAAATATGA GGTGCCGCTA
CCTTCTTCTG TTCCAGTTGT TGAAAATTTG AGCGTTGTGA CAACAACGTA TAATTCTGCT
CTGCTTTCAT GGAGCTATAT AGAGAATGTA TATTTTGAAG TTCAGTTATC AGAAGACATT
AAAAACTCAA ATGCATGGCA GATTGCATCT GACAGTTTTA AACCTTCTTT AAAAGAGATT
GATTATACCA CCGGCCTTTG TTATTTCACA GTTCAAAACT TAAAACCTGA TACGCTCTAC
TATTTTAGGG TTAGGGCATA TATCATCAAA GACAATCAGA AAGTGTATTC AGAGTTTAGC
AGTCCTGTTT TTGGCAGAAC ACAAAAGGTT CCTCCACCGA AAACTCCAGT AGCTTTTGGT
ATAAAAGACT ACGGGAAAGA CTACGCCATT TTTGTTTGGG AGATTGCCGA GACGGGAAGA
AGATATGTTA TTGAGGTAGC TGACAATATT TCATTTTCAA ACTCCCAGAA GTACACAACC
GGCCCAGATA CAACTGAATA TAAAGTAATC GGCTTGAAAC CCAATACAAG GTATTGGGCA
AGACTTTTTG CTATAGCTTC TGATGGTCAG CTATCTCAGT CGACAGAGAT TATCTCTTTT
GTTACTAAAA AGGATATAAG CGAGTATACA GGTGTGTTCG ATTCTGTTCA GGATACAACT
CTGCCTTTTA TAACTGTAGA AGACCCTGCA AGTGGCAGGA TGATAATAGA GATTACCTAT
AGATATGTCA ATGAGTCTTT GGACTCAAAG CCTGTTTTAA TTGATTTTAC AAAGAGAACA
AGTTCATCTA TTTTTCAGTT TGTGATAAAA ATAAGATACG ATGTTTTAAA AGCGCTGGTT
AAGCTCAATA AAGATTGCAT TGTCACATTA GATGGAGCAA CTTCGCAGTT CAATTTTGCT
GCCATCGACA GTAGCGACAT AGATAAACTG ACAGTGTTTG GCGTATCACC ATCAAGTATT
TATACTGAAC TGACATTTAT AAGAGCATCT GACAAATATA ATGTAAAAGA TGCTATCTCT
GAAGTGTATG ATATCAGGTG CACAGCTTCA AGCTTTACAA AGCAAGTAGG AATATCATAT
TTTCAAACGC CAGTTAAAAT TTCGCTAATA AACAGAGAAC CGTGGTCAGT CGCAATACCA
TATGTTTTTG ATTTGACCTC TCTTTCTTGG AAAGAACCAG AAAACGCTGT GTTTGCAAGT
GACAATAAAA GTGTAACTTT TAATTTGCAA ACACCTCAGG CAGTTGTGAT TGTAAGAAAA
GGCTTTTACA AAGATATAAT TTCAAGCAGC TATGCAACAA AACTTTACAA TCTTTTTAAG
ACTATTGCCA GCGATGATAC AAGCGATACA ATAGGAATAA AAAACGCTGT CTCAAAACAG
GAACTTGCCT CTTTTTTGGT ATATTTTGCA GAGAAGAAGA GACTCTACAG GTTTGAGATA
ATTGATGAGT ATATTAAAAA AGCGTACAAG GCAGGACTAA TTGAAAATAC TCAGGACAAT
TCTGTTCTTA CAAAAGAAGC CGCTGTAGAT ATGATGGTAA AGTTTTATGA GATTTACACT
GGCAATGAAA TCTCAGTAGA CGATGTTGCA TGGACAAAAC TTTCTGCTGA TGACAGATAC
TTGTTGTCTT TGAAAAAAGC ATACAAGATG GGTTGGCTTT TTGATTATGT TACGTTCAAT
CCCAAAGAGA CAGCAACAAG AGAATATGTT TTAGCATTCT TCTATCATGT TGTTTCGAAT
ATATGA
 
Protein sequence
MKSIAIKIVS IFLLISFLIA LVPQNLIAQP AIVLPAPQGL NIAVVNNLPR MEVAQDGTIT 
LYFSWQYSYS DFDYFELYLG DDPTRFPYGY TIYKNDPNLT VSNGNYTYKV QSLPNGQKIP
SGTIFYAKVR SVRVLQEQTG NVVYYSSYSN TIVFLTPIFV ECYTNAEDAI DIVWDDVYYS
GKRIDYDIYV SKDISFTTPT KYQIDGDRIT LLQSQKPGGR VEILPGRKLK YTAAGLSPSS
LYYVKVLPRN LPQEVIWRDP QTYTPPNIKV IGEAATYIQA EAQRIADNVV WLRWAKVSIA
ENEYEIYKGS KDQIPTLIGT VSANEFFAVV SITDDVFFRI QVDVFDSFGR KVSIRSKDLY
VHPYTLPFAP PAAENLTAFP KSQDTISLRF KIPTDKDVVY DFYYKKYSDS NVDFTLFVSN
YQMKSSDEEK DENNLPTGYY RFDITGLEKN TVYVLKVVVK KRFYDYEQGT YIYKESTPAL
TISYTTSGDI TPPTTPTLLS VVYTTYDSVV LSWQPVTIAG IQPPTVDRSI FYEVNFAVYQ
VGMDITNPEN LDIASFQKIS LSDYQVDQSG KIVFRVSSLL PNTRYVFFIR AVRKIDSSVY
YSLPSNVVMA TTLIKYEVPL PSSVPVVENL SVVTTTYNSA LLSWSYIENV YFEVQLSEDI
KNSNAWQIAS DSFKPSLKEI DYTTGLCYFT VQNLKPDTLY YFRVRAYIIK DNQKVYSEFS
SPVFGRTQKV PPPKTPVAFG IKDYGKDYAI FVWEIAETGR RYVIEVADNI SFSNSQKYTT
GPDTTEYKVI GLKPNTRYWA RLFAIASDGQ LSQSTEIISF VTKKDISEYT GVFDSVQDTT
LPFITVEDPA SGRMIIEITY RYVNESLDSK PVLIDFTKRT SSSIFQFVIK IRYDVLKALV
KLNKDCIVTL DGATSQFNFA AIDSSDIDKL TVFGVSPSSI YTELTFIRAS DKYNVKDAIS
EVYDIRCTAS SFTKQVGISY FQTPVKISLI NREPWSVAIP YVFDLTSLSW KEPENAVFAS
DNKSVTFNLQ TPQAVVIVRK GFYKDIISSS YATKLYNLFK TIASDDTSDT IGIKNAVSKQ
ELASFLVYFA EKKRLYRFEI IDEYIKKAYK AGLIENTQDN SVLTKEAAVD MMVKFYEIYT
GNEISVDDVA WTKLSADDRY LLSLKKAYKM GWLFDYVTFN PKETATREYV LAFFYHVVSN
I