Gene Aasi_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1150 
Symbol 
ID6376965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1467801 
End bp1470143 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content39% 
IMG OID642682256 
Producthypothetical protein 
Protein accessionYP_001958215 
Protein GI189502498 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGA ATTTAGTCAT TGTTGAGTCA CCTGCTAAAG CCAAAACCAT CCAAAATTAT 
TTAGGAAAAG ACTATGAAGT TGCTTCTTCC AATGGGCATG TACGCGATTT ACCTAAACAT
AATAAGGCTA TTGATATTGA GCAAGACTTT CAACCTACTT ATGAGATTAC TAAAGGTAAG
CAACAGCTTA TAAAAGAACT AAAGAAGAAA GCCAACAACG CACAAAAAGT TTATTTAGCT
AGCGATGATG ATCGAGAAGG AGAAGCTATT TCCTGGCATC TAAAAGAGGC ACTCGACTTA
GTGGATAATA AAACACAAAG AATTGTTTTT AGAGAGATTA CTAAAAATGC TATACTAAAT
GCTATAAAAA ACCCTAGAAA TATTGACCTG GCATTAGTAA ACTCCCAGCA AGCACGGCGT
ATTCTAGATA GGTTGGTAGG ATATGATTTA TCTCCATTGC TGTGGAAAAA GATAAAGCCT
GGCCTTTCTG CAGGGCGGGT GCAATCAGTT GCTGTTAGAA TGATTGTAGA ACGAGAGCGT
GCTATACAGC ATTTTGTTGC TACTAACTAC TTTGTTGTAA CGGCATTATT TGATTTGGGT
AACAAGCAAC GTCTGTTAGC AGAACTTTCT GATCGTTTGC AAAATGAAAA AGAAGCCCAC
CAATTTTTAG AGAAATGTAT AGGTACAACC TTTTCTATTC AAAATTTAGA TAAAAAGCTA
GCTAAAAGGT CACCTGCTCC CCCTTTTACC ACTTCTACTT TACAGCAAGA AGCCAGCCAA
AAGTTTGGCT ATGGCGTTAC ACGTACCATG TTACTAGCAC AAAATCTGTA TGAAGCTGGT
AAGATTTCTT ATATGCGTAC AGATTCTGTG ACGCTTTCGC AAGAGGCTAT TCAAAGTGCA
CAGCAAGAAA TAGCAAGCAA TTATGGTAGT ACTTATTTTC AAGAAAGGCA TTACAAAACA
AAATCTGCTT CGGCACAAGA AGCACATGAA GCCATTAGGC CCACTGACTT TTCTCAACGT
GTTGTAAGTC AAGACGCTAG TGAGCAGCGC TTATATGAAT TAATTTGGAA ACGTGCTATT
GCCTCACAAA TGGCCGATGC TCAATTGGAA AGGACTACCG CACATATTGC TATTTCTACT
ACCCCTCAAC AGCTTGTGGC ACATGGTGAG GTTGTTAAGT TTGATGGCTT CCTAAAAGTA
TATGCAGTTA AACATGATGA AGACGATGAA GCAGACGATC AAGCAGACGA TCAAGCACAA
GGCAGTAAGC TATTACCACC ATTAAAAGTA GGACAACTTC TTCCATTAGA GGATATGCAA
GCGCGTGAGC GTTTTACCAA GCCTCCTGCT AGATATTCTG AAGCTAGCTT GGTTAAGCAG
CTCGAAGAGC AGGGGATTGG ACGCCCCTCT ACTTATGCGC CTACTATTAC TACTATACAA
CAACGCGGTT ATGTAGTTAA AGAATCAAAA GAAGGGAAGG AGCGTAACTA TCAGGTGTTG
ACGCTAAAAG ATGATGCTAT AAAAAAAGAG ATATTTAAGG AAATAACAGG CACAGAAAAA
AATAAGTTGT TCCCTACTGA TATTGCCATG GTAGTGAATG ACTTTTTGGT AGAGCGCTTT
TCCGAAGTAA CTGATTATGG TTTTACTGCT AAAGTAGAGT CTGAGTTAGA TGAGGTAGCT
ACAGGCCATA AAGAGTGGAA CCAGATGTTG GCTGAGTTTT ATAAGAATTT TTATCCCAAA
GTAGCAGAAA CACAACAAGT AGAGCGGACA GCCATTAATA CAAGTAGGAT GTTGGGACAT
GATCCTGTAA GTGGTAAGCC TGTTACTGCA CGTATTGGAA GATTTGGTCC ATTAGTGCAA
ATAGGCAACA ACGAGGAAGA TGAGGCCCCA CGTTTTGCAA GCCTTAAAAA AGATCAGCGT
ATAGAAAGTA TTACCTTAGA AGAAGCACTT ACGCTCTTCA AGTTTCCAAG AACAGTAGGG
CAATGGGAAA ACTTACCTAT AGAAGCGAGT ATAGGCAGGT TTGGTCCTTA TTTGAAATAT
CAAAATCAAT TTTATACCCT TCCCAAAGAA GAGGACCCGC TTACAGTTAC GGAAGAGCGT
GCCATACAGA TTATACAAGA CAAGCAAAAA GCAGATGCAG AAAAAGTAAT TAAAACCTTT
CCAGAGGATG TAAATATGCA AATACTTAAT GGTCGCTGGG GACCTTATCT AAAGGTAGGA
AAGGTAAATG TTAAGTTACC TAAAGACGTA GATCCTAAAA AGTTAAGCTT TGCAGAATGT
CAAAAATTAG CAGAAAGCGC CATGCCAATT GCTACTGCAA AAAAGAAATC TGCTCGTAGA
TAG
 
Protein sequence
MSKNLVIVES PAKAKTIQNY LGKDYEVASS NGHVRDLPKH NKAIDIEQDF QPTYEITKGK 
QQLIKELKKK ANNAQKVYLA SDDDREGEAI SWHLKEALDL VDNKTQRIVF REITKNAILN
AIKNPRNIDL ALVNSQQARR ILDRLVGYDL SPLLWKKIKP GLSAGRVQSV AVRMIVERER
AIQHFVATNY FVVTALFDLG NKQRLLAELS DRLQNEKEAH QFLEKCIGTT FSIQNLDKKL
AKRSPAPPFT TSTLQQEASQ KFGYGVTRTM LLAQNLYEAG KISYMRTDSV TLSQEAIQSA
QQEIASNYGS TYFQERHYKT KSASAQEAHE AIRPTDFSQR VVSQDASEQR LYELIWKRAI
ASQMADAQLE RTTAHIAIST TPQQLVAHGE VVKFDGFLKV YAVKHDEDDE ADDQADDQAQ
GSKLLPPLKV GQLLPLEDMQ ARERFTKPPA RYSEASLVKQ LEEQGIGRPS TYAPTITTIQ
QRGYVVKESK EGKERNYQVL TLKDDAIKKE IFKEITGTEK NKLFPTDIAM VVNDFLVERF
SEVTDYGFTA KVESELDEVA TGHKEWNQML AEFYKNFYPK VAETQQVERT AINTSRMLGH
DPVSGKPVTA RIGRFGPLVQ IGNNEEDEAP RFASLKKDQR IESITLEEAL TLFKFPRTVG
QWENLPIEAS IGRFGPYLKY QNQFYTLPKE EDPLTVTEER AIQIIQDKQK ADAEKVIKTF
PEDVNMQILN GRWGPYLKVG KVNVKLPKDV DPKKLSFAEC QKLAESAMPI ATAKKKSARR