Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_14570 |
Symbol | mfd |
ID | 7760393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1435934 |
End bp | 1439383 |
Gene Length | 3450 bp |
Protein Length | 1149 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643804355 |
Product | transcription-repair coupling factor |
Protein accession | YP_002798648 |
Protein GI | 226943575 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCGTAT TGCGTTTTCC GACCCTGCCG GCGGCCCCCG GCAAACAACA CTGGGGCAAC CTGCCGGGCA CGGCGCTTCC CCTGGCGATC GCCGAAGCCG CCAGCGCCGC GGCCAAGCGC TTCACCCTGC TCCTGACCGC CGACAGCCTG AGCGCCGAAC GCCTGGAACA GGATCTCCGC TTCTTCGCCC CGGAGCTTCC GGTGCTGCAA TTCCCGGACT GGGAAACCCT GCCCTACGAC CTCTTCTCGC CGCACCAGGA TATCGTCTCG CAGCGCATCG CCAGCCTCTA CCGCCTGCCG GAACTGAACC ACGGCGTGCT GGTGGTGCCG ATCACCACCG CCCTGCACCG TCTGGCGCCG AAGCCTTTCC TGCTGGGTAG CAGCCTGGTG CTGGACGTCG GCCAGCGGCT CGATGTGGAA CAGATGCGCC AGCGCCTGGA AGCGGCCGGC TACCGCTGTG TGGAAACCGT CTACGAACAC GGCGAATTCG CCGTGCGCGG CGCCCTGATC GATCTCTTTC CCATGGGCAG CGAGCGCCCC TACCGCATCG ACCTGTTCGA CGACGAGATC GAGACCCTGC GCACCTTCGA CCCGGAGACG CAGCGCTCGA TCGACAAGGT CGAGTCGGTA CGTCTGCTGC CGGCCCGCGA ATTCCCGCTG CGCAAGGAAG CGGTCGCCGC CTTTCGCGCC CGCTTCCGCG AGCGTTTCGA CGTCGACTTC CGCCGCTGCC CGATCTACCA GGACCTCGCC AGCGGTATCA CCCCGGCCGG CATCGAGTAC TACCTGCCGT TGTTCTTCGA GGAAACCGCC ACCCTCTTCG ACTACCTGCC GTCCACCACC CAGGTGTTCT CCCTGCCCGG CATCGAACAG GCCGCCGAGC ACTTCTGGAG CGACGTGCGC AACCGCTACG AGGAGCGCCG CCTCGATCCG GAACGGCCGC TGCTGCCGCC CGCCGAGCTG TTCCTGCCGG TGGAGGACTG TTTCGCACGC CTGAAGAGCT GGCCCCGGGC GATCGTCTCG GCCGAAGAGA TCGAACCCGG TGTCGGCCGC GAGCGCCTGC CCGCACGCGC GCTGCCGTCG CTGGCGATCG ACGCCAGGGC TGGCCAGCCG CTGGCGGCCC TGGCCGGTTT CCTCGACGGC TTCCCGGGCC GCGTGCTGTT CACCGCCGAG TCGGCCGGAC GCCGCGAAGT GCTTCTGGAA CTGCTCGCAC GCCTGAACCT GCGCCCGCAG AGCGTCGACG GCTGGCCGGG CTTTCTCGCC GCCACGGAGC GCCTGGCCAT CGCCATCGCC CCGCTGGACG AGGGCCTGCT GCTCGACGAG CCCGCCCTGG CATTGATCGC CGAGAGCCAA TTGTTCGGTC AGCGGGTGAT GCAGCGCCGG CGCCGGGAAA AGGCGCGCGA CGGCGGCGAC AACGTCATCA AGCACCTCAC CGAACTGCGC GAGGGTGCGC CTGTGGTGCA TATCGACCAC GGCATCGGCC GCTACCGCGG GCTGGTCACC CTGGAGGTCG AGGGCCAGGC CGCCGAGTTC CTCATGCTGG AGTACGCCGA GGAAGCCAAG CTCTACGTAC CGGTGGCCAA CCTGCATCTG ATCGCCCGCT ACACCGGCAG CGACGACGCC CTCGCCCCGC TGCACCGGCT GGGCTCGGAG CAATGGCAGA GGGCCAAGCG CAAGGCCGCC GAGCAGGTCC GCGACGTCGC CGCCGAGCTG CTCGACATCT ATGCCCGGCG CGCCGCCCGC GCGGGCTTCG CCTTCAAGGA CCCGGCCACC GACTACGAAA CCTTCAGCGC CAGCTTCCCC TTCGAGGAAA CCCCCGACCA GCAGGCCGCC ATCGACGCGG TGCGCACCGA CATGCTGGCG CCCCGGCCGA TGGATCGCCT GATCTGCGGC GACGTCGGCT TCGGCAAGAC CGAAGTGGCC ATGCGCGCCG CCTTCATCGC CGTGCATAGC GGCAGGCAGG TCGCGGTGCT GGTGCCGACC ACCCTGCTCG CCCAGCAGCA CTACAACAGC TTTCGCGACC GCTTCGCCGA CTGGCCGGTG AAGGTCGAGG TGATGAGCCG CTTCAAGTCG GCCAAGGAAG TCGAAGGCGC CATCGCCGCA CTGGCCGAAG GCAAGATCGA CATCGTCATC GGCACCCACA AGCTCCTGCA GGACGACGTC AGCTTCAAGA ACCTGGGCCT CGCCATCATC GACGAGGAAC ACCGCTTCGG CGTGCGCCAG AAGGAGCAGC TCAAGGCCTT GCGCAGCGAG GTGGACATCC TCACCCTGAC CGCCACGCCG ATCCCGCGCA CCCTGAACAT GGCCGTCGCC GGCATGCGCG ATCTGTCGAT CATCGCCACC CCGCCGGCGC GCCGACTGTC GGTACGCACC TTCGTCATGG AGCAGCAGAA CACCCTGATC CGCGAGGCGC TGCTGCGCGA GCTGCTGCGC GGCGGCCAGG TTTACTACCT GCACAACGAG GTGAAGACCA TCGAGAAATG CGCCCGCGAA CTGGCCGAAC TGGTGCCCGA GGCACGCATC GCCATCGGTC ACGGGCAGAT GCGCGAGCGC GAACTGGAAC AGGTGATGGG CGACTTCTAC CACAAGCGCT TCAACGTACT GGTGGCCTCG ACCATCATCG AGACCGGCAT CGACGTGCCC AGCGCCAACA CCATCGTCAT CGAACGCGCC GACAAGTTCG GCCTGGCCCA GTTGCACCAG TTGCGCGGCC GGGTCGGTCG CAGCCACCAC CAGGCCTATG CCTACCTGCT CACCCCGCCG CGCAAGCAGA TGACCGAGGA CGCGCAGAAG CGCCTGGAAG CCATCGCCGG CGCCCAGGAC CTCGGCGCCG GCTTCGTCCT GGCCACCCAC GACCTGGAGA TCCGCGGCGC CGGCGAACTG CTCGGCGAGG GCCAGAGCGG CCAGATCCAG GCGGTCGGCT TCACCCTCTA CATGGAAATG CTCGAGCGCG CGGTCAAGGC CATCCGCAAG GGCGAGCAGC CGAACCTCGA GCAGCCGCTG GGCGGCGGCC CGGACATCAA CCTGCGGGTT CCGGCGCTGA TCCCCGAAGA CTACCTGCCG GACGTGCACG CCCGCCTGAT CCTCTACAAG CGCATCGCCT CGGCGAGCGC AGAGGAAGAA CTCAATGAAC TGCAGGTGGA GATGATCGAC CGCTTCGGTC TCCTGCCCGA GCCGACCAAG CACCTGATGC GCCTGACCCG CCTGAAGCTG CAGGCGGAAC GGCTCGGCAT CGCCAAGATC GATGCCGGCC CCCAGGGCGG ACGCATCGAA TTCGCCGCCC ACACCTGCGT CGATCCGTTG GTGCTGATCA AGCTGATCCA GAGCCAGCCG AGCCGCTACA AGTTCGAAGG CGCCACCCTG CTCAGGTTCC AGATACCCAT GGAACGCCCG GAAGAACGCT TCAACACCCT CGAGGCGCTG CTCGAGCGCC TGACTCCGCC ATCCGCGTAA
|
Protein sequence | MSVLRFPTLP AAPGKQHWGN LPGTALPLAI AEAASAAAKR FTLLLTADSL SAERLEQDLR FFAPELPVLQ FPDWETLPYD LFSPHQDIVS QRIASLYRLP ELNHGVLVVP ITTALHRLAP KPFLLGSSLV LDVGQRLDVE QMRQRLEAAG YRCVETVYEH GEFAVRGALI DLFPMGSERP YRIDLFDDEI ETLRTFDPET QRSIDKVESV RLLPAREFPL RKEAVAAFRA RFRERFDVDF RRCPIYQDLA SGITPAGIEY YLPLFFEETA TLFDYLPSTT QVFSLPGIEQ AAEHFWSDVR NRYEERRLDP ERPLLPPAEL FLPVEDCFAR LKSWPRAIVS AEEIEPGVGR ERLPARALPS LAIDARAGQP LAALAGFLDG FPGRVLFTAE SAGRREVLLE LLARLNLRPQ SVDGWPGFLA ATERLAIAIA PLDEGLLLDE PALALIAESQ LFGQRVMQRR RREKARDGGD NVIKHLTELR EGAPVVHIDH GIGRYRGLVT LEVEGQAAEF LMLEYAEEAK LYVPVANLHL IARYTGSDDA LAPLHRLGSE QWQRAKRKAA EQVRDVAAEL LDIYARRAAR AGFAFKDPAT DYETFSASFP FEETPDQQAA IDAVRTDMLA PRPMDRLICG DVGFGKTEVA MRAAFIAVHS GRQVAVLVPT TLLAQQHYNS FRDRFADWPV KVEVMSRFKS AKEVEGAIAA LAEGKIDIVI GTHKLLQDDV SFKNLGLAII DEEHRFGVRQ KEQLKALRSE VDILTLTATP IPRTLNMAVA GMRDLSIIAT PPARRLSVRT FVMEQQNTLI REALLRELLR GGQVYYLHNE VKTIEKCARE LAELVPEARI AIGHGQMRER ELEQVMGDFY HKRFNVLVAS TIIETGIDVP SANTIVIERA DKFGLAQLHQ LRGRVGRSHH QAYAYLLTPP RKQMTEDAQK RLEAIAGAQD LGAGFVLATH DLEIRGAGEL LGEGQSGQIQ AVGFTLYMEM LERAVKAIRK GEQPNLEQPL GGGPDINLRV PALIPEDYLP DVHARLILYK RIASASAEEE LNELQVEMID RFGLLPEPTK HLMRLTRLKL QAERLGIAKI DAGPQGGRIE FAAHTCVDPL VLIKLIQSQP SRYKFEGATL LRFQIPMERP EERFNTLEAL LERLTPPSA
|
| |