Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1150 |
Symbol | |
ID | 6376965 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1467801 |
End bp | 1470143 |
Gene Length | 2343 bp |
Protein Length | 780 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642682256 |
Product | hypothetical protein |
Protein accession | YP_001958215 |
Protein GI | 189502498 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAGA ATTTAGTCAT TGTTGAGTCA CCTGCTAAAG CCAAAACCAT CCAAAATTAT TTAGGAAAAG ACTATGAAGT TGCTTCTTCC AATGGGCATG TACGCGATTT ACCTAAACAT AATAAGGCTA TTGATATTGA GCAAGACTTT CAACCTACTT ATGAGATTAC TAAAGGTAAG CAACAGCTTA TAAAAGAACT AAAGAAGAAA GCCAACAACG CACAAAAAGT TTATTTAGCT AGCGATGATG ATCGAGAAGG AGAAGCTATT TCCTGGCATC TAAAAGAGGC ACTCGACTTA GTGGATAATA AAACACAAAG AATTGTTTTT AGAGAGATTA CTAAAAATGC TATACTAAAT GCTATAAAAA ACCCTAGAAA TATTGACCTG GCATTAGTAA ACTCCCAGCA AGCACGGCGT ATTCTAGATA GGTTGGTAGG ATATGATTTA TCTCCATTGC TGTGGAAAAA GATAAAGCCT GGCCTTTCTG CAGGGCGGGT GCAATCAGTT GCTGTTAGAA TGATTGTAGA ACGAGAGCGT GCTATACAGC ATTTTGTTGC TACTAACTAC TTTGTTGTAA CGGCATTATT TGATTTGGGT AACAAGCAAC GTCTGTTAGC AGAACTTTCT GATCGTTTGC AAAATGAAAA AGAAGCCCAC CAATTTTTAG AGAAATGTAT AGGTACAACC TTTTCTATTC AAAATTTAGA TAAAAAGCTA GCTAAAAGGT CACCTGCTCC CCCTTTTACC ACTTCTACTT TACAGCAAGA AGCCAGCCAA AAGTTTGGCT ATGGCGTTAC ACGTACCATG TTACTAGCAC AAAATCTGTA TGAAGCTGGT AAGATTTCTT ATATGCGTAC AGATTCTGTG ACGCTTTCGC AAGAGGCTAT TCAAAGTGCA CAGCAAGAAA TAGCAAGCAA TTATGGTAGT ACTTATTTTC AAGAAAGGCA TTACAAAACA AAATCTGCTT CGGCACAAGA AGCACATGAA GCCATTAGGC CCACTGACTT TTCTCAACGT GTTGTAAGTC AAGACGCTAG TGAGCAGCGC TTATATGAAT TAATTTGGAA ACGTGCTATT GCCTCACAAA TGGCCGATGC TCAATTGGAA AGGACTACCG CACATATTGC TATTTCTACT ACCCCTCAAC AGCTTGTGGC ACATGGTGAG GTTGTTAAGT TTGATGGCTT CCTAAAAGTA TATGCAGTTA AACATGATGA AGACGATGAA GCAGACGATC AAGCAGACGA TCAAGCACAA GGCAGTAAGC TATTACCACC ATTAAAAGTA GGACAACTTC TTCCATTAGA GGATATGCAA GCGCGTGAGC GTTTTACCAA GCCTCCTGCT AGATATTCTG AAGCTAGCTT GGTTAAGCAG CTCGAAGAGC AGGGGATTGG ACGCCCCTCT ACTTATGCGC CTACTATTAC TACTATACAA CAACGCGGTT ATGTAGTTAA AGAATCAAAA GAAGGGAAGG AGCGTAACTA TCAGGTGTTG ACGCTAAAAG ATGATGCTAT AAAAAAAGAG ATATTTAAGG AAATAACAGG CACAGAAAAA AATAAGTTGT TCCCTACTGA TATTGCCATG GTAGTGAATG ACTTTTTGGT AGAGCGCTTT TCCGAAGTAA CTGATTATGG TTTTACTGCT AAAGTAGAGT CTGAGTTAGA TGAGGTAGCT ACAGGCCATA AAGAGTGGAA CCAGATGTTG GCTGAGTTTT ATAAGAATTT TTATCCCAAA GTAGCAGAAA CACAACAAGT AGAGCGGACA GCCATTAATA CAAGTAGGAT GTTGGGACAT GATCCTGTAA GTGGTAAGCC TGTTACTGCA CGTATTGGAA GATTTGGTCC ATTAGTGCAA ATAGGCAACA ACGAGGAAGA TGAGGCCCCA CGTTTTGCAA GCCTTAAAAA AGATCAGCGT ATAGAAAGTA TTACCTTAGA AGAAGCACTT ACGCTCTTCA AGTTTCCAAG AACAGTAGGG CAATGGGAAA ACTTACCTAT AGAAGCGAGT ATAGGCAGGT TTGGTCCTTA TTTGAAATAT CAAAATCAAT TTTATACCCT TCCCAAAGAA GAGGACCCGC TTACAGTTAC GGAAGAGCGT GCCATACAGA TTATACAAGA CAAGCAAAAA GCAGATGCAG AAAAAGTAAT TAAAACCTTT CCAGAGGATG TAAATATGCA AATACTTAAT GGTCGCTGGG GACCTTATCT AAAGGTAGGA AAGGTAAATG TTAAGTTACC TAAAGACGTA GATCCTAAAA AGTTAAGCTT TGCAGAATGT CAAAAATTAG CAGAAAGCGC CATGCCAATT GCTACTGCAA AAAAGAAATC TGCTCGTAGA TAG
|
Protein sequence | MSKNLVIVES PAKAKTIQNY LGKDYEVASS NGHVRDLPKH NKAIDIEQDF QPTYEITKGK QQLIKELKKK ANNAQKVYLA SDDDREGEAI SWHLKEALDL VDNKTQRIVF REITKNAILN AIKNPRNIDL ALVNSQQARR ILDRLVGYDL SPLLWKKIKP GLSAGRVQSV AVRMIVERER AIQHFVATNY FVVTALFDLG NKQRLLAELS DRLQNEKEAH QFLEKCIGTT FSIQNLDKKL AKRSPAPPFT TSTLQQEASQ KFGYGVTRTM LLAQNLYEAG KISYMRTDSV TLSQEAIQSA QQEIASNYGS TYFQERHYKT KSASAQEAHE AIRPTDFSQR VVSQDASEQR LYELIWKRAI ASQMADAQLE RTTAHIAIST TPQQLVAHGE VVKFDGFLKV YAVKHDEDDE ADDQADDQAQ GSKLLPPLKV GQLLPLEDMQ ARERFTKPPA RYSEASLVKQ LEEQGIGRPS TYAPTITTIQ QRGYVVKESK EGKERNYQVL TLKDDAIKKE IFKEITGTEK NKLFPTDIAM VVNDFLVERF SEVTDYGFTA KVESELDEVA TGHKEWNQML AEFYKNFYPK VAETQQVERT AINTSRMLGH DPVSGKPVTA RIGRFGPLVQ IGNNEEDEAP RFASLKKDQR IESITLEEAL TLFKFPRTVG QWENLPIEAS IGRFGPYLKY QNQFYTLPKE EDPLTVTEER AIQIIQDKQK ADAEKVIKTF PEDVNMQILN GRWGPYLKVG KVNVKLPKDV DPKKLSFAEC QKLAESAMPI ATAKKKSARR
|
| |