Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_4347 |
Symbol | mgpS |
ID | 7387927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 3664054 |
End bp | 3667197 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643652997 |
Product | ATP-dependent helicase |
Protein accession | YP_002551168 |
Protein GI | 222150211 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.755222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCTGA GCGGCCGCAG CGTGATTGCT GTGCTGGGGC CGACCAATAC GGGCAAGACC CATTATGCCA TCGAGCGCAT GGTTGCCCAT GGGTCAGGTA TGATCGGCCT GCCTTTACGG CTGCTGGCGC GCGAAGTTTA CACCAGGCTG GTCGAGCGGG TCGGCGCGGC GCATGTCGCC CTCATCACGG GCGAGGAAAA GATCACCCCG CCCAATACAC GGTTTTCTGT CTGCACCGTC GAAGCGCTGC CGCGCGAGAC CAAAGTCGCG TTCGTGGCCA TTGATGAAAT CCAGTTGGCG GGCGATCTCG AGCGCGGCCA TATTTTTACC GACCGGCTTT TGCATCTGCG CGGACGCGAG GAAACCCTGT TGCTAGGCTC GGCGACCATG AAGCCGATCC TTCAGCATCT TCTGCCGGGT ATTACTGTCG TTGAGCGTCC ACGCCTGTCG CAGCTGTTTT ATGCTGGCGA AAAGAAGATC ACCCGGCTGC CGCAGCGCAC GGCAATCGTC GCCTTCTCCG CCGACGAGGT GTATTCCATT GCCGAATTGA TCCGTCGCCA GCGCGGCGGG GCGGCTGTGG TGCTCGGCGC ACTCAGCCCG CGCACCCGCA ATGCCCAGGT CGGCCTCTAT CAGTCCGGCG ATGTCGAATA TCTTGTGGCG ACCGATGCGA TCGGCATGGG CCTGAACCTC GATGTCGATC ATGTTGCCTT TGCCCAGGAC CGCAAGTTCG ATGGCTATCA GTTCCGCAAT CTCAATCCCG GAGAACTGGG CCAGATTGCC GGGCGCGCCG GACGCCATCT GAAAGATGGC ACGTTCGGCG TCACAGGACG GGTCGATCCC TTTGAGCCGG AACTGGTGGA ACGGTTGCAA AGCCATCATT TCGATCCGGT CAAGGTGCTG CAATGGCGCA CCGCCCAATT CGATTTTTCC TCGATTGCCA ACCTGCGCCG CAGCCTCGAT GCCGCCCCTA AGGTGGCGGG ACTGGTGCGG GCGCTGCCAG CAATCGATCA GCAGGCGCTA GATTATCTGG CCCGCTATCC GGAAGTCCAG GACCTTGCTT CGTCGCCAAA TCGCGTCTCG CTGCTGTGGG ATGCCTGCGC ATTACCGGAT TATCGGCGAA TTACCCCGGC ACAACATGCC GATCTGATTC TTACCCTCTA TTCCGACCTT GCCCGGCACG GTAGTGTGAA CGAAGATTTC ATGGCAGAGC AGGTGCATCG CTCCGACCGG ACAGACGGCG AGATAGATAC TCTTTCTGCG CGCATTGCGC AGATCCGAAC TTGGACGTAT GTGTCGAACC GGCCAGGTTG GCTTGCCGAT CCGACACACT GGCAAGAAAA GACGCGGGAA ATCGAAGATC GATTGTCTGA CGCGTTACAT GAAAGGTTGA CGAAACGCTT TGTTGATCGC AGGACATCTG TGCTTATGAA GCGCTTGAGA GAGAATGGGA TGCTGGAAGC TGAAATCAGT GTGAACGGTG ATGTCTTCGT TGAGGGACAT CACGTGGGCC AACTCTCCGG ATTCCGGTTT ACGCCGATCT CTGGAACCGA GGGGCCGGAC GCCAAGGCGG TGCAAACTGC TGCCCAAAAG GCGCTTGGCC TGGAATTCGA AGCGCGGGCT GCAAGATTGC ATGCTTCGGG CAATGGGGAT CTGGCGCTCA GTTCCGACGG TCTCGTCCGG TGGCTGGGTG ATCCCGTCGC CCGGCTGACC GGCTCGGATC ACATCATGCG CCCCCGCATT CTGCTGCTGG CGGACGAACC GCTGAGCGGC AATGCGCGCG AGCACGTCGT CGCCCGCATC GAGCGCTTCG TCAATCATCA TATCTCGACC ATTCTGAAGC CACTCGACGA TCTGTCGCGG GCCGAGGACT TGCAGGGCCT CTCCAAGGGA TTGGCCTTCC AGCTGGTGGA AGGGCTTGGA ATTCTGTTCC GCCGCGACGT CACCGAAGAG GTCAAGTCGC TGGACCAGGA TGCACGCGCC TCCATGCGCC GCTACGGCGT GCGCTTCGGC GCCTATCACA TCTTCATGCC AGCCCTGTTG AAGCCAGCAC CGGCCGAGCT GATCACCCTG CTCTGGGCCT TGAAGAACGA TGGCCTGGAC AAGCCAGGCT ATGGCGATCT CATTCCGGTT CTGGCCGCAG GCCGCACCTC AGTGGTGACT GATCCGAGCT TTGAGCGCAA TTTCTATAAG CTGGCGGGCT TCCGCTTCCT GGGCAAGCGG GCCGTGCGCA TCGATATTCT GGAGCGGTTG GCGGATCTGA TCCGTCCATT GTTGCAATGG AAGCCGGGCA CCACGCCGCG CCCGGACGGG GCTTACGATG GCCGCCGCTT CACCACTACC ACGGCCATGC TCTCCATCCT CGGTGCGACA CCCGATGACA TGGAAGAAAT CCTGAAAGGC CTCGGCTACC GCGCCGATGC GGTCAAGGCA GAAGAGGCCC AGGCCTTCCT TGGCACCCAG GATGACAAGC CTGCCGCAGC AGCTGCCAAT GCACCAACGC AGACAGCCGA AAGCGTCGAA AAAACCGACG AAGACGATGC GGATCAGCAT GGTGAAGCCT CAGCGACGGA AACTGTCGCT GCTGAACCGG CAGTGCTCGA GACCGTTCCA AATGCTGAGA TCAGCGCCGA AGCAACGACA GAAGCATCGG CGGAGGTCAA AGAGGCCGAA GTTGCAGCGA CCGAGGCTCC ATCCGCTGAC CTTCTTGCCA ATCCGGAAGG ACCAACCGAA CCCAAGCCGG TTCTGCTGTG GCGTCCTGGT GGACGCAACG ACAACCAGCG CCAGGCCGGA CGTCCACAAC AGGGCGACCG CCGTCCGCAG GGTGCAAAGC GCGCCCCGCA GCAGGGCGAG CAGAAGCCGG ACGCTCGTAC CGAAGGTGCC CGTACAGAAG GTGGACGTGA CCGCGACGAC AAGCGCCGCG AGGGTGGCCA TCACGGCAAG CCCCGCGACC GGGACGGCAA CCGCGACAAG CACGCAAACC GGGGCGACCG CGATGATCGC GGTCCGCGTA AAGACCGCGA CAGGGACACA AAATCAACCC AGCCGCAGCG TTTCGAGGCA AAGCCGCCAC GCAAGGAAAA GCCGATCGAT CCGGATTCTC CTTTCGCCAA ACTGGCTGCC TTGAAGGAGC AGATGAAGAA GTAA
|
Protein sequence | MILSGRSVIA VLGPTNTGKT HYAIERMVAH GSGMIGLPLR LLAREVYTRL VERVGAAHVA LITGEEKITP PNTRFSVCTV EALPRETKVA FVAIDEIQLA GDLERGHIFT DRLLHLRGRE ETLLLGSATM KPILQHLLPG ITVVERPRLS QLFYAGEKKI TRLPQRTAIV AFSADEVYSI AELIRRQRGG AAVVLGALSP RTRNAQVGLY QSGDVEYLVA TDAIGMGLNL DVDHVAFAQD RKFDGYQFRN LNPGELGQIA GRAGRHLKDG TFGVTGRVDP FEPELVERLQ SHHFDPVKVL QWRTAQFDFS SIANLRRSLD AAPKVAGLVR ALPAIDQQAL DYLARYPEVQ DLASSPNRVS LLWDACALPD YRRITPAQHA DLILTLYSDL ARHGSVNEDF MAEQVHRSDR TDGEIDTLSA RIAQIRTWTY VSNRPGWLAD PTHWQEKTRE IEDRLSDALH ERLTKRFVDR RTSVLMKRLR ENGMLEAEIS VNGDVFVEGH HVGQLSGFRF TPISGTEGPD AKAVQTAAQK ALGLEFEARA ARLHASGNGD LALSSDGLVR WLGDPVARLT GSDHIMRPRI LLLADEPLSG NAREHVVARI ERFVNHHIST ILKPLDDLSR AEDLQGLSKG LAFQLVEGLG ILFRRDVTEE VKSLDQDARA SMRRYGVRFG AYHIFMPALL KPAPAELITL LWALKNDGLD KPGYGDLIPV LAAGRTSVVT DPSFERNFYK LAGFRFLGKR AVRIDILERL ADLIRPLLQW KPGTTPRPDG AYDGRRFTTT TAMLSILGAT PDDMEEILKG LGYRADAVKA EEAQAFLGTQ DDKPAAAAAN APTQTAESVE KTDEDDADQH GEASATETVA AEPAVLETVP NAEISAEATT EASAEVKEAE VAATEAPSAD LLANPEGPTE PKPVLLWRPG GRNDNQRQAG RPQQGDRRPQ GAKRAPQQGE QKPDARTEGA RTEGGRDRDD KRREGGHHGK PRDRDGNRDK HANRGDRDDR GPRKDRDRDT KSTQPQRFEA KPPRKEKPID PDSPFAKLAA LKEQMKK
|
| |