Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1696 |
Symbol | |
ID | 6375383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1836353 |
End bp | 1839676 |
Gene Length | 3324 bp |
Protein Length | 1107 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642684190 |
Product | transcription-repair coupling factor |
Protein accession | YP_001960096 |
Protein GI | 189500626 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000188307 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCATAATT CACATGAAAC CCCTTCAAAC CGGGACGGAG AAAAACCGGA GCCGAACACA GAAATAACCG AAAGGCAAGC CTCCTACCTT CTCAGCGCCC TCAGACAGCA TCCAGCCTAT GCCGGCATTA AAACGCAAAT GTTCTCATCG TGCGAGCGCA ATACCAGCAA ACCATGCTCT GAAATAACCG GAATTTCAGG CATTCAGGGC TCTCTGGCCA CGCTGCTTGC CGTGCAACTG TTTACTGATC TCGGCATCCC GGTCCTGGTA CTGGCCTCGC AGAGCATATT CGACCTCTAT GATAACGATC TTTCCGAACT GCTTCAACCG GAACAGGTGC ATACCGCCTC AGACGAACTC TCTGCTGCGC TGGGCAAACT CTCCAGCGAT CAAAAAATCG TCCTTCTCGC TTCATTTGAT GATCTGCTCA CTCCGCTGCA CTGCCCTGAA GAGCAGGATT CCATGCTCTT TACTCTACAG ATTGATAAAG AGGCCGGATA CGGCAAACTG AAAAACTTTC TTCTGCAAAA CGACTATAGC AAAAGCGAAT TCGTTGAAGA AGAGGGAGAG TATTCAACAC GTGGATCAAT TATCGATATC TTCCCTTTCG GATCAAAAGC GCCCGTTCGA ATAGAATTCT TCGGAGATAC GGTTACCTCG CTACGCTCCT TTGACACCGG CAGTCAGCTT TCAGGCGCCC GTCTTGACAG GGCGTTACTG GGCGGCAACT TTGTTGCCGA CAAGAAAAGC GACTCTGAGA AATCATGGAA TCTGCTCGAC CATCTTCCCG CCGAAACGTT ACTCATTATC GATGACATAA CCTCACTCAG CGCTCATGAA AACAGAGCTG CTTTTGAGGA CAAACTGATG CGGTTCCGAT GCCTGGGCAT TAACCGGATT GAAAAACAGT CCATTGATTT CCATGCGGAA GCCCAGACAA AATTGCATGC AAATTTCAAA TTGTTCGCAA GGGAACTCAC CCGGATACAG TCCGGCAAAC AGCACACTGT TTTTGTAGGC AACTCAAAAA AAGAGATCGA GGAACTTACA GAATTCATTG CTGAAGAAAT AGACACTACC GGCCTGAAGA AACATATCGA AGTACACTGG GTCCCATTGA ACCTGCACTC CGGTTTTGTC TTCAGCGGCT TTAACGTCTA CACTGAATCA GACATATTCG GCAAGCTGCA TACCCATAAG TCTCATAAAA AAAGAGCCTT TAAAGGCATC TCTCTCAAAG ACCTCCAGAA GCTGAATATT GGAGATTATA TCGTTCATGA AGATTACGGG ATCGGGATTT TCAAAGCACT TGAAACCATA GAGGTCGGGG ATTCAGAGCA GGAAAGCGTC CTGGTCGAGT ATCAGGGGGG TGATCAGCTT TACGTAAATG TCCAGAACAT CCGGCTCCTT TCAAAATACA CCGCATCTGA AGGCTCCCTG CCGGCACTTT CAAAACTTGG AAGCTCAAAG TGGAAAGCAA AAAAAGAGAA AGTAAAAAAT CGCCTGAAGG ATATCGCGTC GAAACTCATC AGACTCTACG CGAAAAGAAA AATGACCACA GGCTTTGCCT TCAGGACGGA TTCCATATTC CAGAAAGAAT TCGAAGCATC GTTCATCTTC GACGAAACCC CTGATCAGTT GAAAACGATT GAACAGGTTA AAAACGATAT GCAGTCAGGC TCACCGATGG ACCGTCTGAT CTGTGGAGAT GCCGGTTTCG GTAAAACCGA AATCGCGATG CGCGCTTCGT TCAAGGCTGT AGAATCAAAT AAACAGGTCG CGCTCCTTAC ACCGACAACC ATACTTGCTC ACCAGCATGC AGAAACATTC AGACGCAGAT TTGAAAACGT TCCTCTGAAC ATAGCCGTAC TCAGCCGTTT TGTTACAAAA AAAGAGCAGA AAAATCTTAT CGAAAAGATC TCACGCGGCC TGATAGATAT AGTCATCGGC ACACACAGGC TTGTCTCAAA AGACGTCGTC TTTCAGGATC TCGGGCTGCT CATTATTGAT GAAGAACAGC ATTTCGGTGT TTCCACAAAA GAAAAGCTGC GTCAGAGCTT TCCAGGTGTA GATACGCTGA CCATGTCGGC AACTCCGATA CCGAGAACAT TGCAGTTTTC AATGCTTGGC GCAAGAGATC TGTCCATTGT ATCAACTCCA CCGAAAAACC GTCAGCCTGT TGAAACAGTC ATCACGCAGT TCAATCCGGA CACCATTCGT TCCGCCATAC AGCATGAAAT AAGCCGTCGG GGCCAGGTTT TTTTTCTCCA TAACCGCATC GCCGGTCTTG ACCAGATCCG GAACACCCTG CAGGAACTCG CGCCAGAAGC CAGAATCGGC ATTGCTCACG GACAGATGCC GACATCCGAA CTCGAGAACG TGATGATGGA GTTCATCAGC CATAAACTCG ATCTCCTCAT TTCAACATCG ATCATCGGAT CAGGTCTTGA TATATCCAAC GCAAATACCA TTATCATCAA TCGTGCGGAT ATGTTCGGCC TTTCAGATCT CTATCAGCTC AGGGGCCGTG TCGGCAGAAG CGAACGCAAA GCATACAGCT ATCTCATCAT CCCGCCGGTT CACACGCTCA AAAGAGAAGC GATGGAAAGA CTGGCCGTCA TCGAAAGCTT TACCGAACTT GGTTCCGGCT TCAATATCGC GCTACGTGAC CTCGATATCA GAGGAGCGGG AAACCTCCTG GGAGCGGAAC AGTCTGGAGC CATTCATGAA CTGGGATTCG ACCTCTACCA GAAACTGCTT GAAGAAGCTG TCACCGAACT CAAGACAGGA GTCTTCAGGT CACTCTTTAC CGACAAAGAG CCCCAGACCA GCCAGACGGC AGGAATCACT GACATGATTT TTTATTTTGA CGCTCTGATT CCTGATTATT ACGTCTCTGC AACCCAGGAA CGGTTTTCCT TCTATGAAAA AATATCAAAA AGCACGACAG ACAGCCAGAT AGAAGCTATT GAAAAAGAAT TACGCGACCG GTTCGGCCCG GTTCCCCGTG ATGTAGCAAA CCTGATAGGC CTTGCAAAGC TTAAAAACCT CTGTTCATCC ATCGGTCTTG TCAAAGTCGA CATACAGCAA GACAAATTCA CACTGTACCT TCCCGATGAC GAAAACACTG AATTCTACAC TCAGCCTTTC TTCCAGGAAC TGATCGCATG TGTACAGTCC GAATGGATGC AGAAATATCG ACCTCGCTTT CAGGAAGGGA AAAAGATGAA GCTGATTATC CATCTTCCCT CAAAGACATC TACTGAACCG TCGGTCATCA TGGAACTTTA CAGCTCCCTG CTCAACAAAC TCAAAGGGGA GTAA
|
Protein sequence | MHNSHETPSN RDGEKPEPNT EITERQASYL LSALRQHPAY AGIKTQMFSS CERNTSKPCS EITGISGIQG SLATLLAVQL FTDLGIPVLV LASQSIFDLY DNDLSELLQP EQVHTASDEL SAALGKLSSD QKIVLLASFD DLLTPLHCPE EQDSMLFTLQ IDKEAGYGKL KNFLLQNDYS KSEFVEEEGE YSTRGSIIDI FPFGSKAPVR IEFFGDTVTS LRSFDTGSQL SGARLDRALL GGNFVADKKS DSEKSWNLLD HLPAETLLII DDITSLSAHE NRAAFEDKLM RFRCLGINRI EKQSIDFHAE AQTKLHANFK LFARELTRIQ SGKQHTVFVG NSKKEIEELT EFIAEEIDTT GLKKHIEVHW VPLNLHSGFV FSGFNVYTES DIFGKLHTHK SHKKRAFKGI SLKDLQKLNI GDYIVHEDYG IGIFKALETI EVGDSEQESV LVEYQGGDQL YVNVQNIRLL SKYTASEGSL PALSKLGSSK WKAKKEKVKN RLKDIASKLI RLYAKRKMTT GFAFRTDSIF QKEFEASFIF DETPDQLKTI EQVKNDMQSG SPMDRLICGD AGFGKTEIAM RASFKAVESN KQVALLTPTT ILAHQHAETF RRRFENVPLN IAVLSRFVTK KEQKNLIEKI SRGLIDIVIG THRLVSKDVV FQDLGLLIID EEQHFGVSTK EKLRQSFPGV DTLTMSATPI PRTLQFSMLG ARDLSIVSTP PKNRQPVETV ITQFNPDTIR SAIQHEISRR GQVFFLHNRI AGLDQIRNTL QELAPEARIG IAHGQMPTSE LENVMMEFIS HKLDLLISTS IIGSGLDISN ANTIIINRAD MFGLSDLYQL RGRVGRSERK AYSYLIIPPV HTLKREAMER LAVIESFTEL GSGFNIALRD LDIRGAGNLL GAEQSGAIHE LGFDLYQKLL EEAVTELKTG VFRSLFTDKE PQTSQTAGIT DMIFYFDALI PDYYVSATQE RFSFYEKISK STTDSQIEAI EKELRDRFGP VPRDVANLIG LAKLKNLCSS IGLVKVDIQQ DKFTLYLPDD ENTEFYTQPF FQELIACVQS EWMQKYRPRF QEGKKMKLII HLPSKTSTEP SVIMELYSSL LNKLKGE
|
| |