Gene Haur_4389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4389 
Symbol 
ID5736239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5606211 
End bp5608592 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content50% 
IMG OID641281551 
ProductDNA topoisomerase I 
Protein accessionYP_001547149 
Protein GI159900902 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00166014 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCATA AATTGGTGAT TGTTGAGTCG CCAGCCAAAG CAAAAACTAT TCAAAAATAT 
CTGGGTGCTG GCTATCGCGT TATGGCGAGT ATGGGCCATG TGCGTGATTT GCCCAAGAGT
AAAATTGGCA TTGATATTGA CAATGATTTT AGCCCTGTCT ATGAAATTAG CGAGGGCAAA
GATAAGCTCA TTGCCGAATT GAAGCGCGAA ATCAAGACTG CTGATGCTAT TTACCTCGCA
ACCGACCACG ACCGCGAAGG CGAGGCGATC GCTTGGCATA TTTTACAAGC AGCTAATATT
GGCAAACGTA AGCCAGTCTA TCGCATTACC TTTAACGAAA TTACCAAGGA TTCGATTCAG
CACGCCATTC GCAATCCGCG CGAAATCGAC GCAAACCTAG TTGATGCCCA ACAGGCACGA
CGGGTGCTTG ATCGCTTGGT TGGCTATAAA ATTTCACCAA TTTTGTGGGC CAAGGTGCGG
CGCGGGCTTT CGGCTGGGCG GGTGCAATCG GTTGCTGTGC GTATGGTGGT TGAGCGTGAA
CGCGAAATCG AAAGCTTCGT GCCCAAAGAA TATTGGACGA TCGAGGCCGA TTTATCGCCA
GCTGGGCTGA AGAAACTTGG CAAGCACGAT ATTTTTCGCG CAATCTTGCA TGCTCGCAAT
GGCAAAAAGC TCGATAAATT TGCGATTCCT AGCAAAGATG CTGCTGATGC AGTTTTGGCT
GCCTTGGAAG GGGCAAATTA TCTTGTTGGC ACTGTAACCC GCAAGGATAA ACGGCGCTCG
CCAGCCCCGC CATTTATCAC CAGTACCCTG CAACAAGAAG CTAGCCGCAA GCTTGGGTTT
AGCTCCAAAC GCACTATGCA AGTGGCCCAA AAACTGTATG AAGGGGTCGA TATTGGTGGC
AAAGATGGTA CAGTTGGTCT GATTACCTAT ATGCGTACCG ATTCAACCAA CGTTTCAGTC
GATGCCCAAA CCGAGGCTCG TACACTGATT ACTGAACTCT ATGGCAAAGA GTACGTTCCA
GCTAAGCCCA ATATCTACAA AACCAAGGCT AAAGGTGCGC AGGAAGCTCA CGAGGCGATC
CGCCCAACCA GCGTAGTCCG CCGCCCTGAT CAATTAAAAA CAGCGCTTGG CCGCGATGAG
TTTCGGCTTT ACGACTTGAT CTGGAAGCGC TTTATGGCTT CGCAGATGGC GGCGGCAATT
TTTGATAGTA CGAGCGTCGA TATTGGTGCT GGGGCCGGAA TCAAAACTGC TGCTGGAGCG
CCATTTACCT TCCGCGCAAC TGGCTCAGTG CTCAAATTCA ATGGCTTTTT GGCAGTCTAC
AACGTCAGCC TCGATGAAGG CGACGAAGAT GAAGACAAAG AGGCCTTGTT GCCGCCGCTC
AACGAAGGCC AAGCCCTCGA TTTGCATGAT CTCTTTGGCG AGCAACATTT CACTACGCCA
CCACCACGCT ATACCGAAGC AACCTTGGTT AAGCAGATGG AAAGTGAAGG GATTGGTCGC
CCATCAACCT ATGCGCCGAC GATCTCAACC ATCGTTGCCC GCGAATATGT TGAGTTGGTC
GAAAAGAAAT TGATGCCCAC CACCTTAGGC CGGGTTGTGA CCGACTTGCT AGTTGAGCAC
TTCAAAGATA TTGTCGATTA CAACTTTACT TCGGATATGG AACAGCGGCT TGATGATATC
GCTGAAGGCC AACGTCGTTG GGTGCCAGTG CTGCGCGAAT TTTACGATCC GTTTGCTGTA
CGCTTGCATG CTGCCGAAAC CGAAATGCGC AACGTCAAGC GCGAAGAAAT CAAAACTGAA
TTGCCCTGCC CGCAATGTGG CACGCATCTG GTGATCAAAT GGGGGCGTAA TGGCGAATTC
TTGGCTTGTT CGCGTTACCC TGAGTGTAGC TGGACGGGTG ATTTAGAGCG TGATGGCGAT
GGCGGGATTC ATATCGCCTC GCAGCCTGAA ATTTTTGGCA ACACCAATTG CCCTGAATGC
GACAACCCAA TGAGCCTCAA AAAAGGTCGT TTTGGGCCGT TTCTCGCCTG TAACAACTAC
CCAACTTGTA AGGGTATTCG CAAAGTACGG GCGCAAGGCA AAGATTTTGT GGTGATTCCG
CCGCCTAAGC CAACCGAGGA AAAATGCCCC AAATGTAACC GCCCAATGGT GCAAAAAGAG
GGCAAATTTG GGCCATTCTT ATCATGCACC GGCTACCCTG AATGCCGCTC GATTGTACGA
TTAACGCCCA ACGATGCCCC AACCTGCCCT CAATGTGGTG AGGGTAAAGT TGTCGCCAAA
CGTGCGCGTG GTGGCCGTAC CTTCTTCTCT TGCACACGCT ACCCCGATTG TACGTATGCG
AGTAACGCCT TACCTGTGGC GGTTTCCGAA GAAGTAGCCT AA
 
Protein sequence
MDHKLVIVES PAKAKTIQKY LGAGYRVMAS MGHVRDLPKS KIGIDIDNDF SPVYEISEGK 
DKLIAELKRE IKTADAIYLA TDHDREGEAI AWHILQAANI GKRKPVYRIT FNEITKDSIQ
HAIRNPREID ANLVDAQQAR RVLDRLVGYK ISPILWAKVR RGLSAGRVQS VAVRMVVERE
REIESFVPKE YWTIEADLSP AGLKKLGKHD IFRAILHARN GKKLDKFAIP SKDAADAVLA
ALEGANYLVG TVTRKDKRRS PAPPFITSTL QQEASRKLGF SSKRTMQVAQ KLYEGVDIGG
KDGTVGLITY MRTDSTNVSV DAQTEARTLI TELYGKEYVP AKPNIYKTKA KGAQEAHEAI
RPTSVVRRPD QLKTALGRDE FRLYDLIWKR FMASQMAAAI FDSTSVDIGA GAGIKTAAGA
PFTFRATGSV LKFNGFLAVY NVSLDEGDED EDKEALLPPL NEGQALDLHD LFGEQHFTTP
PPRYTEATLV KQMESEGIGR PSTYAPTIST IVAREYVELV EKKLMPTTLG RVVTDLLVEH
FKDIVDYNFT SDMEQRLDDI AEGQRRWVPV LREFYDPFAV RLHAAETEMR NVKREEIKTE
LPCPQCGTHL VIKWGRNGEF LACSRYPECS WTGDLERDGD GGIHIASQPE IFGNTNCPEC
DNPMSLKKGR FGPFLACNNY PTCKGIRKVR AQGKDFVVIP PPKPTEEKCP KCNRPMVQKE
GKFGPFLSCT GYPECRSIVR LTPNDAPTCP QCGEGKVVAK RARGGRTFFS CTRYPDCTYA
SNALPVAVSE EVA