Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2656 |
Symbol | |
ID | 5734536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3406156 |
End bp | 3409779 |
Gene Length | 3624 bp |
Protein Length | 1207 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279798 |
Product | transcription-repair coupling factor |
Protein accession | YP_001545422 |
Protein GI | 159899175 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0357057 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGATC ATGTTGATAC GTTGCTAGCA GATATTCAGG CGCTCTTTGC CAGCGAGCCG CAGCTTAAAG CAGCCTTAGC AGCAGCCCAA CCCAATACAA CCCGCACAAT CACGCCTGTA CCAAGTGCAG CACGCCGCGT CGTCGCCGCC ATCGCGACCC GGCAACATCG GAGCAGTTTG TTGATTGCTG CTACACCTGA TGCCGCAGTA CGCATGCACG CCGACCTAAC CGCATGGCTC GGCGAGAACG TCATGCTCTT TCCACCGACC GATACGCTGC CCTACGAGCA TATGTCGGCG GATATTGGGA TTGTAGCGCA ACGTTTGGCA GTGCTCGGGC GCTTGCATGC TAGCGAACCG ATCTGCGTAG TCGCTTCGGT CAAAGCCTTG ATGCAACCCA CCATGACCCC TGAAGAATTT CAGTTTGCCA CGCGCTTGTT GCGCCAAGGC GACCAGCATG ATCCACGCAA GTTGCTCGCT CACTGGGTCA GTTTGGGCTA TCGGGTTGGC CCAACCGCCG AACAACCAGG CGATCTCAGC CAGCGCGGCG GGATTATCGA TATTTTTCCG CCAACCAGCG ATCGCCCAAT TCGGCTCGAA TTTTGGGATG ATCAGCTCGA AAGTTTGCGT ATTTACGACC CAATCTCGCA GCGCTCAGAT AAACGGGTGC GCCAAATTCA AATCAGCCCA GCCCATGAAA TTCCGTTTTG GCGACGAACC GAGGCGATCA AACGGATCGA GCAGTTGCAA ATTGCCAGCT TGCGCCGCGA AGTACAGCAC GAATGGGCCA CCGCCCGCGA ACATCTTGAA ACAGGCCAGC GCTTTGAAGG GCGGGCCTTT TATGCCCCGT TTTTTCGCAC GCCCCACGAA GCAGCCGAGG CTGGGCTATG GCAACATTTG CCAGCCTCAG CAATCATCCT GCTTTCCGAA GAGCATGAAT TGAATGCCCA AGGCATTGAA TTGCAAAGCC ATGCCGATTT GGTGCGCAGC ACTCAAATTG AAAATAATGA ATTACCGCCC GATTTTCCGT TTCCATTAAT CGCTTGGACG TTTATTCGGC GGTCGATTCA ACGCTGGAGT TGCCTCAACC TCAGCAATCA GCCCATCGCC GATGATCAAA ACGATAGCTT TGTGCATGAA ATTAATACAT TTAGCCAAGC TGCATCCTAT GGCGGCCAAA CTGATCGGCT GTTTGAGGAT CTGCGCGAAC GCTTGGTTGG TGGCGAACGA GTGGTGTTGA TTTCACCACA GGCTAGTCGG CTGCGCGAAT TGGCTAGCCA ACACAATTTA GCCCTAATTG GCGAAGAAGA CGGCCCCGAT GTTGACGATC CGCCATTTCA AAGCGGCACG ATCATCATTC GCCATGGCAA TTTATCAGGT GGGTTTAGCA GCGATCCATT GCGCTTAACG ATTTTAAGCG ATAGTGAGAT TTTTGGCTGG CAGCAACGCC GAGCACTTTC AACTCGTGCC CGCAAACAGC GCACCGAAAG CGATCGCACG GCCTTTTTGC AATCGCTCAA AGTTGGCGAT TATGTGGTGC ATATCGAGCA TGGCATCGCT CAATACGAGG GTTTATCACG GATCGAGGCT AGCGGCGCTG AACGCGAGTT TTTGGTGCTG CGCTACGCTT CGGGCGATAA ATTATATGTG CCGGTCGATC AGGTCGATCG GGTATCACGC TATATCGGCG CAGGCGAAGG CAAGCCAACC CTCACCCGTT TGGGCACCTC AGATTGGGAG CGTGCCAAGC GCAAAGTGCG GGCCGATGTT GAAGAACTCG CGACTGAATT GCTCGATTTA TACGCAGCTC GTCAGTTAGT CGAGGGCTTT GCCTATAGCA GCGACACCTC GTGGCAACGC GAACTCGAAG ATAGTTTTCC CTATACCGAG ACCGACGATC AACTGCGGGC AATCGAAGAA GTCAAAAGCG ACATGGAAAA TACCCGCCCA ATGGATCGGC TGATCTGTGG CGATGTGGGC TTTGGCAAAA CTGAAGTTGC CTTACGCGCC GCCTTCAAAG CGGTGCAAGA TGGCCGCCAA GTTGCGGTGC TGGTACCAAC CACCGTCCTA GCCCAACAAC ATTTTGAAAC CTTCTCGCGG CGCATGCAAA TGTTTCCGGT ACGGATCGAA ATGCTTTCAC GCTTTCGCTC AGCATCACAG CAAAAATCAA TCACCGAACG GATCGTCAAG GGCGAAATTG ATATTGTGGT TGGCACGCAT CGGATTTTAT CCAGCGATAT TCATTTTAAG CAACTCGGCT TGGTGATTAT CGACGAAGAA CAACGCTTCG GCGTTAAAGA TAAAGAGCGA CTTAAAAAAT TGCGCCATGA AATTGATGTG CTGACATTAA CTGCCACGCC GATTCCGCGC ACAATGCACA TGGCTTTGTC GGGTATTCGC GATTTAAGCG TGATCGACAC ACCGCCCGAC GATCGCATGC CAATCAAAAC CTATGTGCAG CCCTACAACG AAATGTTGGT ACGCGATGCG ATTTTGCGCG AATTAGGGCG TAATGGTCAA GCCTATTTTG TGCATAATCG GGTGCAATCA ATTTATACTG TCGCCAATCG GCTGCAAAAA CTCGTGCCCG AAGCCCGAAT CGGGGTTGGC CATGGCCAAA TGCCCGAAAA AGCGCTCGAA AAAGTTATTT TGCAATTTTT CGAAGGCTTG TTTGATGTGT TTGTGTGTAC GACGATTATC GAAAGCGGCA TCGACGTGCC CAGCGCCAAC ACCATGATTA TTGATGATGC AACAACCTAT GGACTAGCCC AACTCTATCA ATTGCGGGGG CGAGTTGGTC GCTCAACCCA GCGCGGTTAT GCCTATATGT TCTACAACCC CACCAAAGCC ATGGGCGAAG AAGCGCAAAA GCGGCTTGAG GCGATTCAAG AGGCCACCGA ACTTGGGGCA GGCTTTCGCA TCGCCATGCG CGACCTAGAA ATTCGTGGCA CTGGCAATTT GCTTGGAGCT GAACAATCGG GCAATATCAC CACGATTGGC TTTGATCTCT ACTCGCGCTT GCTTTCGCAG GCAGTCGAGC GCGTGCGTGA AGAACGCAAA CGCGGCCAAC AGCAAAAATC TGGCGAACAA AAAGCCCAAC GCGCTCGTGC AGTCGAGGCC TTGCGCCGAG CAGCAGTGGT TTCAGCTCGC AGCAGTTTTA GCGGCGATGC CGATGATCCA GTCTTGCCCG ATGCGATGGT CAGCATTGAT CTGCCGATTA ATGCCTACTT GCCGCAAAAT TATGTTGACG ATGAACCGCT GCGTTTGCGC GTCTATCAAC ATATTGCCGA AGCACGTTCA ACCCGCGATA TTCGTATGTT GCGCCAAGAA TTAGAAGATC GCTTTGGACC TGTGCCAGAG CCAGCCGCCC GCCTGCTCGA TTTGCTGACA ATCAAAGTTT TAGCCTTACA AGCTGGCGTT ATTTCGATCA TTTCTGATGA TAATGAAATT ACTGTGCGCT TGCCCAAGAG CGTCTATCTT GATCGCGAGC AACTTCAGCG CGAATCACCA CGTGGCGTAG TAATTGGCCC GCAATTGGCG CGGCTTGATC GGCGGGTATT GCGCGATGAT TGGGAAGTTG TGTTGCGGCA GTTGCTCGAA GCACTCAACA AAATTGGCAA TTAA
|
Protein sequence | MTDHVDTLLA DIQALFASEP QLKAALAAAQ PNTTRTITPV PSAARRVVAA IATRQHRSSL LIAATPDAAV RMHADLTAWL GENVMLFPPT DTLPYEHMSA DIGIVAQRLA VLGRLHASEP ICVVASVKAL MQPTMTPEEF QFATRLLRQG DQHDPRKLLA HWVSLGYRVG PTAEQPGDLS QRGGIIDIFP PTSDRPIRLE FWDDQLESLR IYDPISQRSD KRVRQIQISP AHEIPFWRRT EAIKRIEQLQ IASLRREVQH EWATAREHLE TGQRFEGRAF YAPFFRTPHE AAEAGLWQHL PASAIILLSE EHELNAQGIE LQSHADLVRS TQIENNELPP DFPFPLIAWT FIRRSIQRWS CLNLSNQPIA DDQNDSFVHE INTFSQAASY GGQTDRLFED LRERLVGGER VVLISPQASR LRELASQHNL ALIGEEDGPD VDDPPFQSGT IIIRHGNLSG GFSSDPLRLT ILSDSEIFGW QQRRALSTRA RKQRTESDRT AFLQSLKVGD YVVHIEHGIA QYEGLSRIEA SGAEREFLVL RYASGDKLYV PVDQVDRVSR YIGAGEGKPT LTRLGTSDWE RAKRKVRADV EELATELLDL YAARQLVEGF AYSSDTSWQR ELEDSFPYTE TDDQLRAIEE VKSDMENTRP MDRLICGDVG FGKTEVALRA AFKAVQDGRQ VAVLVPTTVL AQQHFETFSR RMQMFPVRIE MLSRFRSASQ QKSITERIVK GEIDIVVGTH RILSSDIHFK QLGLVIIDEE QRFGVKDKER LKKLRHEIDV LTLTATPIPR TMHMALSGIR DLSVIDTPPD DRMPIKTYVQ PYNEMLVRDA ILRELGRNGQ AYFVHNRVQS IYTVANRLQK LVPEARIGVG HGQMPEKALE KVILQFFEGL FDVFVCTTII ESGIDVPSAN TMIIDDATTY GLAQLYQLRG RVGRSTQRGY AYMFYNPTKA MGEEAQKRLE AIQEATELGA GFRIAMRDLE IRGTGNLLGA EQSGNITTIG FDLYSRLLSQ AVERVREERK RGQQQKSGEQ KAQRARAVEA LRRAAVVSAR SSFSGDADDP VLPDAMVSID LPINAYLPQN YVDDEPLRLR VYQHIAEARS TRDIRMLRQE LEDRFGPVPE PAARLLDLLT IKVLALQAGV ISIISDDNEI TVRLPKSVYL DREQLQRESP RGVVIGPQLA RLDRRVLRDD WEVVLRQLLE ALNKIGN
|
| |