Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2619 |
Symbol | |
ID | 5900074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2841693 |
End bp | 2845160 |
Gene Length | 3468 bp |
Protein Length | 1155 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641563110 |
Product | transcription-repair coupling factor |
Protein accession | YP_001684244 |
Protein GI | 167646581 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.205951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTACG ACGCCAAACA GATCGCCAAG GCCGCCGGGG GCCTGACCCT GGCCGGGGCG CCGGAAGGCT TCGACGCGCT GGTCATGGCC GACATCGCCC GCGCGCGCGG CGGCCTGACC GCCTTCGTGG CCCGCGACAC CGCCCGGGCC AGCGCCTTCA TCGACGCCCT GAAGTTCTTC GCCCCCGAGA TCGAGGCGGT GCTGTTCCCG TCCTGGGACT GCCTGCCTTA CGACCGCATC GGCCCTTCGG CCGGCGTCGC CGCCACGCGG ATGGCCACCC TGCACCGCCT GGCCGAGGGC CTGGGCAAGG ACAAGGCGGC CATCCTGGTC ACCGCCGCGC CGGCCCTGCT GCAGCGCGTG CCCGAGAAGA CCGTGCTGCT GCGCGCCAGC TACGGCGCCA AGGTCGGCAA CGTCGTCGAT GTGGCGGACC TGGAGCGCTA TTTCGCGATC AACGGCTATG TCCGCGCCTC CACCGTGTCA GAGCGCGGCG AGTTCGCGAT CCGCGGCGGC GTGATCGACG TCTATCCGCC GGCCGCCGAG GAGCCGGTGC GCCTGGACCT GTTCGGCGAC ACCCTGGAAA GCATCCGCGC CTTCGATCCG GAAACCCAGC GCTCGACCAA GCAACTGCGT GAGATCCAAC TTCTGCCGGT CAGCGAGGCC CTGCTCGACA AGGAGGGCAT TTCGCGGTTC CGCTCGGGTT ACATCCGCGA GTTCGGCGCG CCGGGCGATG ACGCGCTGTA CGCCACGGTC AGCGAGGGCG GCCGCCGCGC CGGGCTCGAA CACTGGCTGC CGCTGTTCTA CGAGCGGATG GCGACCCTGT TCGACTATCT GCCCGACGGA GCCCTGGTCG GCATCGACCA CCAGGTGACC GAGAGCCGCG ACGAGCGCCT GGCGATGATC GCCGACGCCT ATGACGCTCG GGCCTCGGCC GACCGCAAGT CGGCCTATCG CCCGCTGAAG CCGGACGCCC TCTACCTGAC CGCCGCCGAG TGGGACCAGG CCGTCGAGGA CCGGACCCAC CGCAAGTTCA CGCCGTTCCA GCCCCAGGGT CTCGACGTCG TCGACATGGG CGCCAAGCTC GGACGCACCT TCGCCGCCGA GCGCGCCCAG GACAGCGTCA ACCTGTTCCA GGCCACAGCC GATCACGCCA CCACGCTAGC GGGGCAGGGC AAGCGCGTGC TGTTCGCCTC GTGGTCCGAG GGCTCGTCCG AGCGCCTGGG CATGATGCTG GCCGACCACG GTATTAAAAA GCTAAGCTTC GCCGCCTATT GGCAGGCCGC CAAGGCCGCC GATCCCAAGA TCCCGCAGCG CGTGGTCCTG CCGCTGGACA ACGGCTTCGA GACCGAGACC CTCGTCGTCA TCTCCGAGAC CGACATCCTC GGCGACCGCT TGGCCCGGCC GCGCAAGAAG CGCCGCGCCG CCAACTTCCT GGCCGAGGCC AGCGCCCTGA CGCCCGGCGA CCTGGTCGTC CACATCGACC ACGGCATCGG TCGCTATGAG GGGCTCAAGA CGCTCGACGT CCAGGGCGCG CCCCACGACT GCCTGGACCT GCTGTATGGC GGCGAGGCCA AGCTCTACCT GCCGGTCGAA AACATCGACC TGCTGACCCG CTACGGGACC GACGCCGAGA ACGTCCAGTT GGACAAGCTG GGCGGCGCGG CCTGGCAGGG GCGCAAGTCC AAGGCCAAGG AACGCCTGAG GGTGATGGCC GAGGGCCTGA TCCAGATCGC CGCGGCCCGC CAGTTGAAGT CGGTGGAAGA GACCGACCCG CCGCACGGGG TGTTCGACGA GTTCTGCGCG CGCTTCCCCT ATGAGGAGAC CGACGACCAG TTGTCGGCCA TCGCCGACGT TCTGGAAGAT CTCGGCTCGG GCAAGCCGAT GGACCGGCTG ATCTGCGGCG ACGTGGGTTT TGGCAAGACG GAAGTCGCCC TGCGCGCGGC GTTCGTGGTG GCCATGAGCG GCAAGCAGGT GGCCATCGTC TGCCCGACCA CGCTGCTGGC TCGCCAGCAC TACAAAACCT TCAAGGACCG CTTCCAAGGC TGGCCGGTCA AGGTCACGCG CCTGTCGCGC CTGGTGACCG GCAAGGAAGC CGCCGAGACC CGCGAGGGCC TGGCCAACGG CCAACTGGAG ATCGTGGTCG GCACCCACGC CATCCTGTCC AAGCAGGTCT CGTTCAAGGA CCTTGGCCTG GTCATCGTCG ACGAGGAGCA GCACTTCGGG GTCAAGCACA AGGAGAAGCT GAAAGAGCTT CGCGCCGACG TGCACATGCT GACCCTGACC GCCACGCCGA TCCCGCGCAC CCTGCAGATG GCGCTGTCGG GCATCCGCGA GATGTCGATC ATCGCCACCC CGCCGGTCGA TCGCCTGGCG GTGCGCACCT ATATCAGCCC CTTCGACCCG GTGACCCTGC GCGAGGCCCT GCTGCGCGAG AAGTATCGCG GCGGCCAGGC CTATTACGTG GTGCCGCGCA TCAAGGACCT GGAGGAGATC GAGAAGTTCC TTCGCACCCA AGTGCCTGAG GTCAAGTACG TGGTCGGCCA TGGCCAGATG GCCCCGACCC AGCTGGAAGA CGTGATGACG GCCTTCTACG AGGGCCAGTA CGACGTGCTG CTCGCGACCA CGATCGTGGA AAGCGGGCTC GATATCCCGT CGGCCAACAC CCTGATCGTC CATCGCGCCG ACATGTTCGG CCTGGCCCAG CTCTACCAGA TCCGCGGCCG TGTCGGTCGC TCCAAGGCCC GCGCCTACGC CTATCTGACC ACGCCCAACG AGAAGCAGAT CACCCTGTCG GCCGAGAAGC GCCTGAAGGT GCTGCAATCG CTGGACAGCC TGGGCGCCGG CTTTCAGCTG GCCAGCCACG ACCTGGACCA GCGCGGCGGC GGCAACCTGC TGGGCGACGA GCAGAGCGGC CACATCAAGG AGATCGGCGT CGAGCTGTAC CAGCAGATGC TGGAGGACGC CGTCGCCGAG CTGCGCGAGC GCCAGGGCGC CGAGGCCCTG ATCGAGGATC GCGGCTGGTC GCCGCAGATC AACACCGGCG CGGCCGTGAT GATCCCCGAC GACTACGTGC CCGACCTGAA CGTGCGCCTG TCGCTCTATC GCCGCCTGTC GGAAGCCGAA AAGGCCGCCG ACCGCGAAGC CCTGGCCGCC GAGCTGATCG ACCGCTTCGG CCCGCTGCCG CCCGAGACCG ACAGCCTGCT GAAGGTGGTC GCCATCAAGG GCCTGTGCCG CGAAGCCAAT GTCGCCAAGA TCGACGTCGG GCCCAAGGGC GCGGTCGCCA GCTTCCGCAA CGACATCTAC GCCAACCCGC TGGCGCTGAT GCAGTTCGTG GCCAAGAACA ACGTCGTCTG GAAGGTGCGT CCCGACCAGA AGGTGGTGAT CAAGGGCGAG TGGGACACGC CCGCCCGGCG GCTGGACGCG GCCGAGAAGA TCCTGGCGCA ACTGGTCAAG CTGGCCAAGA CGGCGTGA
|
Protein sequence | MAYDAKQIAK AAGGLTLAGA PEGFDALVMA DIARARGGLT AFVARDTARA SAFIDALKFF APEIEAVLFP SWDCLPYDRI GPSAGVAATR MATLHRLAEG LGKDKAAILV TAAPALLQRV PEKTVLLRAS YGAKVGNVVD VADLERYFAI NGYVRASTVS ERGEFAIRGG VIDVYPPAAE EPVRLDLFGD TLESIRAFDP ETQRSTKQLR EIQLLPVSEA LLDKEGISRF RSGYIREFGA PGDDALYATV SEGGRRAGLE HWLPLFYERM ATLFDYLPDG ALVGIDHQVT ESRDERLAMI ADAYDARASA DRKSAYRPLK PDALYLTAAE WDQAVEDRTH RKFTPFQPQG LDVVDMGAKL GRTFAAERAQ DSVNLFQATA DHATTLAGQG KRVLFASWSE GSSERLGMML ADHGIKKLSF AAYWQAAKAA DPKIPQRVVL PLDNGFETET LVVISETDIL GDRLARPRKK RRAANFLAEA SALTPGDLVV HIDHGIGRYE GLKTLDVQGA PHDCLDLLYG GEAKLYLPVE NIDLLTRYGT DAENVQLDKL GGAAWQGRKS KAKERLRVMA EGLIQIAAAR QLKSVEETDP PHGVFDEFCA RFPYEETDDQ LSAIADVLED LGSGKPMDRL ICGDVGFGKT EVALRAAFVV AMSGKQVAIV CPTTLLARQH YKTFKDRFQG WPVKVTRLSR LVTGKEAAET REGLANGQLE IVVGTHAILS KQVSFKDLGL VIVDEEQHFG VKHKEKLKEL RADVHMLTLT ATPIPRTLQM ALSGIREMSI IATPPVDRLA VRTYISPFDP VTLREALLRE KYRGGQAYYV VPRIKDLEEI EKFLRTQVPE VKYVVGHGQM APTQLEDVMT AFYEGQYDVL LATTIVESGL DIPSANTLIV HRADMFGLAQ LYQIRGRVGR SKARAYAYLT TPNEKQITLS AEKRLKVLQS LDSLGAGFQL ASHDLDQRGG GNLLGDEQSG HIKEIGVELY QQMLEDAVAE LRERQGAEAL IEDRGWSPQI NTGAAVMIPD DYVPDLNVRL SLYRRLSEAE KAADREALAA ELIDRFGPLP PETDSLLKVV AIKGLCREAN VAKIDVGPKG AVASFRNDIY ANPLALMQFV AKNNVVWKVR PDQKVVIKGE WDTPARRLDA AEKILAQLVK LAKTA
|
| |