Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1944 |
Symbol | |
ID | 7407358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2050960 |
End bp | 2054565 |
Gene Length | 3606 bp |
Protein Length | 1201 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643716316 |
Product | Fibronectin type III domain protein |
Protein accession | YP_002573804 |
Protein GI | 222529922 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCCA TCGCAATTAA AATAGTTTCG ATTTTCCTTT TAATATCTTT TTTGATTGCT TTGGTTCCCC AAAATCTAAT AGCACAGCCT GCAATTGTTC TTCCTGCACC TCAGGGTTTG AATATAGCAG TTGTAAATAA TCTTCCGAGG ATGGAAGTAG CTCAGGATGG AACTATAACA CTTTATTTTT CTTGGCAATA CAGTTATTCT GACTTTGATT ACTTTGAGCT GTATCTTGGT GATGACCCCA CAAGATTTCC TTATGGATAT ACCATATACA AAAATGACCC AAACCTCACA GTTTCAAATG GCAATTATAC TTATAAAGTT CAGAGTCTTC CGAATGGGCA AAAGATTCCA AGCGGAACAA TTTTTTATGC CAAGGTAAGA AGTGTGAGAG TTTTACAGGA ACAGACAGGA AATGTTGTTT ATTACTCTTC GTATTCAAAT ACAATTGTGT TTTTGACACC TATTTTTGTT GAATGTTATA CAAATGCTGA AGATGCCATT GATATAGTGT GGGATGATGT TTACTATTCT GGCAAAAGAA TAGATTATGA CATATATGTG TCAAAAGACA TAAGCTTTAC AACTCCAACG AAGTATCAGA TAGATGGTGA CAGGATAACC TTACTTCAGA GCCAAAAACC GGGTGGAAGA GTTGAAATTC TGCCAGGCAG GAAACTCAAA TATACAGCAG CAGGACTTTC GCCAAGTAGC CTTTACTATG TAAAAGTTTT GCCGAGAAAT TTGCCTCAAG AAGTTATCTG GCGTGACCCA CAGACATACA CACCACCAAA TATTAAGGTA ATCGGTGAGG CGGCAACATA CATTCAGGCG GAGGCTCAGA GAATTGCTGA CAATGTTGTG TGGTTAAGGT GGGCAAAGGT ATCAATTGCC GAAAACGAAT ATGAAATTTA CAAAGGTAGT AAAGATCAGA TACCTACTTT AATTGGTACA GTTTCGGCGA ACGAGTTTTT TGCTGTTGTT AGTATCACAG ATGATGTGTT TTTCAGAATA CAGGTTGATG TTTTTGATAG TTTTGGCAGA AAGGTGTCTA TCAGATCAAA AGACTTGTAT GTTCATCCAT ATACACTGCC TTTTGCACCA CCTGCGGCAG AAAATTTGAC TGCTTTTCCG AAGTCTCAAG ATACAATCTC TTTAAGATTT AAAATACCAA CAGATAAAGA TGTTGTGTAT GATTTTTATT ACAAAAAATA TTCGGATAGC AATGTTGATT TTACTCTTTT TGTATCCAAT TATCAGATGA AAAGTTCTGA TGAGGAAAAG GACGAGAACA ATCTTCCCAC AGGATATTAT AGGTTTGACA TCACAGGTCT TGAAAAAAAC ACTGTTTATG TTTTAAAGGT TGTGGTGAAG AAGAGGTTTT ATGATTATGA ACAGGGGACA TACATTTACA AGGAATCAAC CCCTGCTTTG ACAATTTCAT ATACTACTTC TGGCGACATA ACTCCGCCAA CCACTCCAAC ACTTTTGTCT GTTGTATATA CAACTTACGA TTCTGTAGTT TTGTCATGGC AGCCTGTAAC TATTGCAGGT ATCCAGCCGC CGACTGTAGA CAGGAGCATC TTCTATGAGG TCAACTTTGC AGTGTACCAG GTTGGAATGG ATATAACTAA TCCAGAAAAT CTTGATATAG CAAGTTTCCA AAAGATATCA CTTTCTGACT ATCAGGTAGA TCAATCAGGC AAGATAGTTT TCAGGGTAAG CAGTCTCTTG CCAAACACAA GGTATGTATT TTTTATAAGA GCTGTGAGAA AGATTGACAG TAGTGTATAC TATTCATTAC CATCCAATGT TGTAATGGCA ACAACGCTGA TAAAATATGA GGTGCCGCTA CCTTCTTCTG TTCCAGTTGT TGAAAATTTG AGCGTTGTGA CAACAACGTA TAATTCTGCT CTGCTTTCAT GGAGCTATAT AGAGAATGTA TATTTTGAAG TTCAGTTATC AGAAGACATT AAAAACTCAA ATGCATGGCA GATTGCATCT GACAGTTTTA AACCTTCTTT AAAAGAGATT GATTATACCA CCGGCCTTTG TTATTTCACA GTTCAAAACT TAAAACCTGA TACGCTCTAC TATTTTAGGG TTAGGGCATA TATCATCAAA GACAATCAGA AAGTGTATTC AGAGTTTAGC AGTCCTGTTT TTGGCAGAAC ACAAAAGGTT CCTCCACCGA AAACTCCAGT AGCTTTTGGT ATAAAAGACT ACGGGAAAGA CTACGCCATT TTTGTTTGGG AGATTGCCGA GACGGGAAGA AGATATGTTA TTGAGGTAGC TGACAATATT TCATTTTCAA ACTCCCAGAA GTACACAACC GGCCCAGATA CAACTGAATA TAAAGTAATC GGCTTGAAAC CCAATACAAG GTATTGGGCA AGACTTTTTG CTATAGCTTC TGATGGTCAG CTATCTCAGT CGACAGAGAT TATCTCTTTT GTTACTAAAA AGGATATAAG CGAGTATACA GGTGTGTTCG ATTCTGTTCA GGATACAACT CTGCCTTTTA TAACTGTAGA AGACCCTGCA AGTGGCAGGA TGATAATAGA GATTACCTAT AGATATGTCA ATGAGTCTTT GGACTCAAAG CCTGTTTTAA TTGATTTTAC AAAGAGAACA AGTTCATCTA TTTTTCAGTT TGTGATAAAA ATAAGATACG ATGTTTTAAA AGCGCTGGTT AAGCTCAATA AAGATTGCAT TGTCACATTA GATGGAGCAA CTTCGCAGTT CAATTTTGCT GCCATCGACA GTAGCGACAT AGATAAACTG ACAGTGTTTG GCGTATCACC ATCAAGTATT TATACTGAAC TGACATTTAT AAGAGCATCT GACAAATATA ATGTAAAAGA TGCTATCTCT GAAGTGTATG ATATCAGGTG CACAGCTTCA AGCTTTACAA AGCAAGTAGG AATATCATAT TTTCAAACGC CAGTTAAAAT TTCGCTAATA AACAGAGAAC CGTGGTCAGT CGCAATACCA TATGTTTTTG ATTTGACCTC TCTTTCTTGG AAAGAACCAG AAAACGCTGT GTTTGCAAGT GACAATAAAA GTGTAACTTT TAATTTGCAA ACACCTCAGG CAGTTGTGAT TGTAAGAAAA GGCTTTTACA AAGATATAAT TTCAAGCAGC TATGCAACAA AACTTTACAA TCTTTTTAAG ACTATTGCCA GCGATGATAC AAGCGATACA ATAGGAATAA AAAACGCTGT CTCAAAACAG GAACTTGCCT CTTTTTTGGT ATATTTTGCA GAGAAGAAGA GACTCTACAG GTTTGAGATA ATTGATGAGT ATATTAAAAA AGCGTACAAG GCAGGACTAA TTGAAAATAC TCAGGACAAT TCTGTTCTTA CAAAAGAAGC CGCTGTAGAT ATGATGGTAA AGTTTTATGA GATTTACACT GGCAATGAAA TCTCAGTAGA CGATGTTGCA TGGACAAAAC TTTCTGCTGA TGACAGATAC TTGTTGTCTT TGAAAAAAGC ATACAAGATG GGTTGGCTTT TTGATTATGT TACGTTCAAT CCCAAAGAGA CAGCAACAAG AGAATATGTT TTAGCATTCT TCTATCATGT TGTTTCGAAT ATATGA
|
Protein sequence | MKSIAIKIVS IFLLISFLIA LVPQNLIAQP AIVLPAPQGL NIAVVNNLPR MEVAQDGTIT LYFSWQYSYS DFDYFELYLG DDPTRFPYGY TIYKNDPNLT VSNGNYTYKV QSLPNGQKIP SGTIFYAKVR SVRVLQEQTG NVVYYSSYSN TIVFLTPIFV ECYTNAEDAI DIVWDDVYYS GKRIDYDIYV SKDISFTTPT KYQIDGDRIT LLQSQKPGGR VEILPGRKLK YTAAGLSPSS LYYVKVLPRN LPQEVIWRDP QTYTPPNIKV IGEAATYIQA EAQRIADNVV WLRWAKVSIA ENEYEIYKGS KDQIPTLIGT VSANEFFAVV SITDDVFFRI QVDVFDSFGR KVSIRSKDLY VHPYTLPFAP PAAENLTAFP KSQDTISLRF KIPTDKDVVY DFYYKKYSDS NVDFTLFVSN YQMKSSDEEK DENNLPTGYY RFDITGLEKN TVYVLKVVVK KRFYDYEQGT YIYKESTPAL TISYTTSGDI TPPTTPTLLS VVYTTYDSVV LSWQPVTIAG IQPPTVDRSI FYEVNFAVYQ VGMDITNPEN LDIASFQKIS LSDYQVDQSG KIVFRVSSLL PNTRYVFFIR AVRKIDSSVY YSLPSNVVMA TTLIKYEVPL PSSVPVVENL SVVTTTYNSA LLSWSYIENV YFEVQLSEDI KNSNAWQIAS DSFKPSLKEI DYTTGLCYFT VQNLKPDTLY YFRVRAYIIK DNQKVYSEFS SPVFGRTQKV PPPKTPVAFG IKDYGKDYAI FVWEIAETGR RYVIEVADNI SFSNSQKYTT GPDTTEYKVI GLKPNTRYWA RLFAIASDGQ LSQSTEIISF VTKKDISEYT GVFDSVQDTT LPFITVEDPA SGRMIIEITY RYVNESLDSK PVLIDFTKRT SSSIFQFVIK IRYDVLKALV KLNKDCIVTL DGATSQFNFA AIDSSDIDKL TVFGVSPSSI YTELTFIRAS DKYNVKDAIS EVYDIRCTAS SFTKQVGISY FQTPVKISLI NREPWSVAIP YVFDLTSLSW KEPENAVFAS DNKSVTFNLQ TPQAVVIVRK GFYKDIISSS YATKLYNLFK TIASDDTSDT IGIKNAVSKQ ELASFLVYFA EKKRLYRFEI IDEYIKKAYK AGLIENTQDN SVLTKEAAVD MMVKFYEIYT GNEISVDDVA WTKLSADDRY LLSLKKAYKM GWLFDYVTFN PKETATREYV LAFFYHVVSN I
|
| |