Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_0729 |
Symbol | |
ID | 5162196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | - |
Start bp | 814685 |
End bp | 817360 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640552645 |
Product | DNA topoisomerase I |
Protein accession | YP_001233869 |
Protein GI | 148259742 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.074453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCCA TCGTCGTCGT TGAATCCCCC GCCAAGGCCA AGACGATCAA CAAGTATCTC GGCGACGGAT TCACCGTTCT CGCCTCCTTC GGCCATGTCC GCGACCTGCC GCCGAAGGAC GGCTCGGTCC GCCCGGACGA GGATTTCGCG ATGTCCTGGG AGGAGGACGC CCGCGGCGAC CGCCAGGTCG CGGCCATCGC CAAGGCGCTG AAAGGGGCGG ACACGCTCTA CCTCGCCACC GACCCCGATC GCGAGGGCGA GGCGATCTCC TGGCATGTCC GCGCCATGCT CGAGGCGCGC AACGTGCTCA AGGGCAAGCA GGTCCGCCGC ATCACCTTCA ACGAGATCAC CAAATCGGCG GTGCAGGCCG CCCTCCGCGC CCCGCGCGAC CTCGACACCC CGCTGATCGA GGCCTATCTC GCCCGCCGCG CGCTCGATTA TCTGGTCGGC TTCACCCTCT CGCCGGTGCT CTGGCGCAAG CTGCCCGGCA GCAAGAGCGC CGGACGCGTG CAGTCGGTCG CGCTGCGGCT GATCTGCGAG CGCGAGGCCG AGATCGAGCT GTTCCGCCCG CGCGAATACT GGACGATCGA GGCCGAGTTC CGCACCCCCG CCGGCGCGCC GTTCCGCGCC CGCCTCACCC ATCTCGACGG CCGCAAGCTC GACCAGTTCG ACCTCGCGGA CAAGGCGGCG GCCGATGCGG CGAAGGCGGC GGTCGAGCGT GGTGCCTTCA GCGTCGTCTC GATCGAGAAG AAGCGCGTCC GCCGCAACCC GCCGCCGCCC TTCACCACCT CGACCCTGCA GCAGGACGCC TCGCGCAAGC TGCACATGAC GGCGCAGGCC ATCATGCGCA CCGCCCAGCA GCTCTACGAG GGTGTGGAGA TCGGCGGCGA GACGGTCGGC CTGATCACCT ATATGCGAAC CGACGGCGTG CAGATGGCGC GCGAGGCCAT CACCGCCGCC CGCCAGCAGG TGAAGGACGC GTTCGGCGCG AACTACCTGC CCGCCGCCCC GCGCGAATAC CAGTCGAAGG CGAAGAACGC GCAGGAGGCC CACGAGGCGA TCCGCCCGAC CGATTTCACC CTCACCCCCG AGCGCGCCGC CCGCCACCTC TCGCCCGAGC AGGCGCGGCT CTACGAGCTG ATCTACAAGC GCGCCCTCGC CTCGCAGATG CAGTCGGCCG AGCTTGACCA GACCACGGTC GAGCTTGCCG ATGCCGCCGG CACCACCCTG CGCGCCACCG GCTCGATCCT CGCCTTCGAC GGCTTCCTCA AGCTCTACCG CGAGGGGATG GACGAGGACC CGGAGGACGA AACCGCGAGG CTGCTGCCGC CGATGGCGAA ATCCGACCCG CTCGCCAGCG GGCCGGTCAC CGCCGAGCAG CATTTCACCC AGCCGCCGCC GCGCTATTCC GAGGCCACCC TGGTCAAGCG CATGGAAGAG CTCGGCATCG GCCGCCCCTC GACCTACGCC TCGATTCTCA CCGTGCTGCG CGAGCGCAAC TATGTGCGGA TGGAAAACCG CCGCTTCATC CCCGAGGATC GCGGCCGGCT GGTCACCGCC TTCCTGGTCA GCTTCTTCGG CCGCTATGTC GAGACCGGCT TCACCGCCGA TCTCGAGGAG AAGCTCGACG AGGTCTCCGA CGGACGGCTC GACTGGCGGT CGGTGATGCG GGATTTCTGG GCAGATTTCT CTGGCGCCGT CGAGCAGATC AAGGATCTCA AGATCTCCGA CGTGATCGAT GCGCTGGACG AGGAGCTCGG CCCGCATTTC TTCCCCGCCC GCGAGGACGG GTCCGATCCA CGCGTCTGCC AGGCCTGCGG CACCGGCCGG CTCGGCCTGC GGCTCGGCCG CTTCGGCGCC TTCATCGGCT GCTCGAACTA TCCCGAATGC CAGTATACCC GCCGCCTCGC CGTCGAGGGC GGCGAGGACG AGGGCGACCA GCTGCGCGAC GGCATGAAGC TGCTCGGCGA GAACGCCGAG GGCATCCCCG TCACGCTGCG GCGCGGCCCC TATGGCCTCT ATGTCCAGCT CGGCGAGCCG GACCCGGAGG ACAAGAAGGC GAAGCCCCGC CGCGCCGCCC TGCCGCGGGG CATGAGCGGC GAGACGATCA CGCTGGAGCA GGCGATCGGC CTGCTCTCGC TGCCGCGCGT GATCGGCGTG CATCCGGAAA CGAAACAGGA AATCCAGGCC GGCATCGGCC GGTTCGGCCC CTATGTGAAG ATGGGCCCCA TCTTCGCCTC GCTCGACAAG GATGACGACG TGCTCGCCGT CGGCCTCAAC CGCGCGGTGA TGGTGCTGGC GAAGAAGCAG GAAGGCATCC GCGATCTCGG CCCGCACCCG AAGGACGGCG AGAGCGTGAC CGCGCGCAAG GGCCGGTTCG GCCCCTATGT CCAGCACGGC AAGACGGTGG CGACCCTGCC GCGCGGCAGC GAACTCGGCG CGGTGACCCT TGATGAGGCT GTCGCCCTGC TCGCCGAGAA GGGCAAGACG CTGGCGCCGC GCGGCCGCAA GGGCGCAAAA CCCGCAGCAA AGCCGAAGGA CGCGGCGGCG CCGAAAGCCG CCAGACCCGC GAAGGCGAAA GCTGCCGCCA AGCCGAAAGC CGCCGCCGCG AAGCCGGAGG CGGCCGCCGC CAAGGCAAAA CCCGCCGCCC GGAAGACCGC CCGCTCCGGC GGCTGA
|
Protein sequence | MNAIVVVESP AKAKTINKYL GDGFTVLASF GHVRDLPPKD GSVRPDEDFA MSWEEDARGD RQVAAIAKAL KGADTLYLAT DPDREGEAIS WHVRAMLEAR NVLKGKQVRR ITFNEITKSA VQAALRAPRD LDTPLIEAYL ARRALDYLVG FTLSPVLWRK LPGSKSAGRV QSVALRLICE REAEIELFRP REYWTIEAEF RTPAGAPFRA RLTHLDGRKL DQFDLADKAA ADAAKAAVER GAFSVVSIEK KRVRRNPPPP FTTSTLQQDA SRKLHMTAQA IMRTAQQLYE GVEIGGETVG LITYMRTDGV QMAREAITAA RQQVKDAFGA NYLPAAPREY QSKAKNAQEA HEAIRPTDFT LTPERAARHL SPEQARLYEL IYKRALASQM QSAELDQTTV ELADAAGTTL RATGSILAFD GFLKLYREGM DEDPEDETAR LLPPMAKSDP LASGPVTAEQ HFTQPPPRYS EATLVKRMEE LGIGRPSTYA SILTVLRERN YVRMENRRFI PEDRGRLVTA FLVSFFGRYV ETGFTADLEE KLDEVSDGRL DWRSVMRDFW ADFSGAVEQI KDLKISDVID ALDEELGPHF FPAREDGSDP RVCQACGTGR LGLRLGRFGA FIGCSNYPEC QYTRRLAVEG GEDEGDQLRD GMKLLGENAE GIPVTLRRGP YGLYVQLGEP DPEDKKAKPR RAALPRGMSG ETITLEQAIG LLSLPRVIGV HPETKQEIQA GIGRFGPYVK MGPIFASLDK DDDVLAVGLN RAVMVLAKKQ EGIRDLGPHP KDGESVTARK GRFGPYVQHG KTVATLPRGS ELGAVTLDEA VALLAEKGKT LAPRGRKGAK PAAKPKDAAA PKAARPAKAK AAAKPKAAAA KPEAAAAKAK PAARKTARSG G
|
| |