Gene Avi_4347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_4347 
SymbolmgpS 
ID7387927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3664054 
End bp3667197 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content61% 
IMG OID643652997 
ProductATP-dependent helicase 
Protein accessionYP_002551168 
Protein GI222150211 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.755222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCTGA GCGGCCGCAG CGTGATTGCT GTGCTGGGGC CGACCAATAC GGGCAAGACC 
CATTATGCCA TCGAGCGCAT GGTTGCCCAT GGGTCAGGTA TGATCGGCCT GCCTTTACGG
CTGCTGGCGC GCGAAGTTTA CACCAGGCTG GTCGAGCGGG TCGGCGCGGC GCATGTCGCC
CTCATCACGG GCGAGGAAAA GATCACCCCG CCCAATACAC GGTTTTCTGT CTGCACCGTC
GAAGCGCTGC CGCGCGAGAC CAAAGTCGCG TTCGTGGCCA TTGATGAAAT CCAGTTGGCG
GGCGATCTCG AGCGCGGCCA TATTTTTACC GACCGGCTTT TGCATCTGCG CGGACGCGAG
GAAACCCTGT TGCTAGGCTC GGCGACCATG AAGCCGATCC TTCAGCATCT TCTGCCGGGT
ATTACTGTCG TTGAGCGTCC ACGCCTGTCG CAGCTGTTTT ATGCTGGCGA AAAGAAGATC
ACCCGGCTGC CGCAGCGCAC GGCAATCGTC GCCTTCTCCG CCGACGAGGT GTATTCCATT
GCCGAATTGA TCCGTCGCCA GCGCGGCGGG GCGGCTGTGG TGCTCGGCGC ACTCAGCCCG
CGCACCCGCA ATGCCCAGGT CGGCCTCTAT CAGTCCGGCG ATGTCGAATA TCTTGTGGCG
ACCGATGCGA TCGGCATGGG CCTGAACCTC GATGTCGATC ATGTTGCCTT TGCCCAGGAC
CGCAAGTTCG ATGGCTATCA GTTCCGCAAT CTCAATCCCG GAGAACTGGG CCAGATTGCC
GGGCGCGCCG GACGCCATCT GAAAGATGGC ACGTTCGGCG TCACAGGACG GGTCGATCCC
TTTGAGCCGG AACTGGTGGA ACGGTTGCAA AGCCATCATT TCGATCCGGT CAAGGTGCTG
CAATGGCGCA CCGCCCAATT CGATTTTTCC TCGATTGCCA ACCTGCGCCG CAGCCTCGAT
GCCGCCCCTA AGGTGGCGGG ACTGGTGCGG GCGCTGCCAG CAATCGATCA GCAGGCGCTA
GATTATCTGG CCCGCTATCC GGAAGTCCAG GACCTTGCTT CGTCGCCAAA TCGCGTCTCG
CTGCTGTGGG ATGCCTGCGC ATTACCGGAT TATCGGCGAA TTACCCCGGC ACAACATGCC
GATCTGATTC TTACCCTCTA TTCCGACCTT GCCCGGCACG GTAGTGTGAA CGAAGATTTC
ATGGCAGAGC AGGTGCATCG CTCCGACCGG ACAGACGGCG AGATAGATAC TCTTTCTGCG
CGCATTGCGC AGATCCGAAC TTGGACGTAT GTGTCGAACC GGCCAGGTTG GCTTGCCGAT
CCGACACACT GGCAAGAAAA GACGCGGGAA ATCGAAGATC GATTGTCTGA CGCGTTACAT
GAAAGGTTGA CGAAACGCTT TGTTGATCGC AGGACATCTG TGCTTATGAA GCGCTTGAGA
GAGAATGGGA TGCTGGAAGC TGAAATCAGT GTGAACGGTG ATGTCTTCGT TGAGGGACAT
CACGTGGGCC AACTCTCCGG ATTCCGGTTT ACGCCGATCT CTGGAACCGA GGGGCCGGAC
GCCAAGGCGG TGCAAACTGC TGCCCAAAAG GCGCTTGGCC TGGAATTCGA AGCGCGGGCT
GCAAGATTGC ATGCTTCGGG CAATGGGGAT CTGGCGCTCA GTTCCGACGG TCTCGTCCGG
TGGCTGGGTG ATCCCGTCGC CCGGCTGACC GGCTCGGATC ACATCATGCG CCCCCGCATT
CTGCTGCTGG CGGACGAACC GCTGAGCGGC AATGCGCGCG AGCACGTCGT CGCCCGCATC
GAGCGCTTCG TCAATCATCA TATCTCGACC ATTCTGAAGC CACTCGACGA TCTGTCGCGG
GCCGAGGACT TGCAGGGCCT CTCCAAGGGA TTGGCCTTCC AGCTGGTGGA AGGGCTTGGA
ATTCTGTTCC GCCGCGACGT CACCGAAGAG GTCAAGTCGC TGGACCAGGA TGCACGCGCC
TCCATGCGCC GCTACGGCGT GCGCTTCGGC GCCTATCACA TCTTCATGCC AGCCCTGTTG
AAGCCAGCAC CGGCCGAGCT GATCACCCTG CTCTGGGCCT TGAAGAACGA TGGCCTGGAC
AAGCCAGGCT ATGGCGATCT CATTCCGGTT CTGGCCGCAG GCCGCACCTC AGTGGTGACT
GATCCGAGCT TTGAGCGCAA TTTCTATAAG CTGGCGGGCT TCCGCTTCCT GGGCAAGCGG
GCCGTGCGCA TCGATATTCT GGAGCGGTTG GCGGATCTGA TCCGTCCATT GTTGCAATGG
AAGCCGGGCA CCACGCCGCG CCCGGACGGG GCTTACGATG GCCGCCGCTT CACCACTACC
ACGGCCATGC TCTCCATCCT CGGTGCGACA CCCGATGACA TGGAAGAAAT CCTGAAAGGC
CTCGGCTACC GCGCCGATGC GGTCAAGGCA GAAGAGGCCC AGGCCTTCCT TGGCACCCAG
GATGACAAGC CTGCCGCAGC AGCTGCCAAT GCACCAACGC AGACAGCCGA AAGCGTCGAA
AAAACCGACG AAGACGATGC GGATCAGCAT GGTGAAGCCT CAGCGACGGA AACTGTCGCT
GCTGAACCGG CAGTGCTCGA GACCGTTCCA AATGCTGAGA TCAGCGCCGA AGCAACGACA
GAAGCATCGG CGGAGGTCAA AGAGGCCGAA GTTGCAGCGA CCGAGGCTCC ATCCGCTGAC
CTTCTTGCCA ATCCGGAAGG ACCAACCGAA CCCAAGCCGG TTCTGCTGTG GCGTCCTGGT
GGACGCAACG ACAACCAGCG CCAGGCCGGA CGTCCACAAC AGGGCGACCG CCGTCCGCAG
GGTGCAAAGC GCGCCCCGCA GCAGGGCGAG CAGAAGCCGG ACGCTCGTAC CGAAGGTGCC
CGTACAGAAG GTGGACGTGA CCGCGACGAC AAGCGCCGCG AGGGTGGCCA TCACGGCAAG
CCCCGCGACC GGGACGGCAA CCGCGACAAG CACGCAAACC GGGGCGACCG CGATGATCGC
GGTCCGCGTA AAGACCGCGA CAGGGACACA AAATCAACCC AGCCGCAGCG TTTCGAGGCA
AAGCCGCCAC GCAAGGAAAA GCCGATCGAT CCGGATTCTC CTTTCGCCAA ACTGGCTGCC
TTGAAGGAGC AGATGAAGAA GTAA
 
Protein sequence
MILSGRSVIA VLGPTNTGKT HYAIERMVAH GSGMIGLPLR LLAREVYTRL VERVGAAHVA 
LITGEEKITP PNTRFSVCTV EALPRETKVA FVAIDEIQLA GDLERGHIFT DRLLHLRGRE
ETLLLGSATM KPILQHLLPG ITVVERPRLS QLFYAGEKKI TRLPQRTAIV AFSADEVYSI
AELIRRQRGG AAVVLGALSP RTRNAQVGLY QSGDVEYLVA TDAIGMGLNL DVDHVAFAQD
RKFDGYQFRN LNPGELGQIA GRAGRHLKDG TFGVTGRVDP FEPELVERLQ SHHFDPVKVL
QWRTAQFDFS SIANLRRSLD AAPKVAGLVR ALPAIDQQAL DYLARYPEVQ DLASSPNRVS
LLWDACALPD YRRITPAQHA DLILTLYSDL ARHGSVNEDF MAEQVHRSDR TDGEIDTLSA
RIAQIRTWTY VSNRPGWLAD PTHWQEKTRE IEDRLSDALH ERLTKRFVDR RTSVLMKRLR
ENGMLEAEIS VNGDVFVEGH HVGQLSGFRF TPISGTEGPD AKAVQTAAQK ALGLEFEARA
ARLHASGNGD LALSSDGLVR WLGDPVARLT GSDHIMRPRI LLLADEPLSG NAREHVVARI
ERFVNHHIST ILKPLDDLSR AEDLQGLSKG LAFQLVEGLG ILFRRDVTEE VKSLDQDARA
SMRRYGVRFG AYHIFMPALL KPAPAELITL LWALKNDGLD KPGYGDLIPV LAAGRTSVVT
DPSFERNFYK LAGFRFLGKR AVRIDILERL ADLIRPLLQW KPGTTPRPDG AYDGRRFTTT
TAMLSILGAT PDDMEEILKG LGYRADAVKA EEAQAFLGTQ DDKPAAAAAN APTQTAESVE
KTDEDDADQH GEASATETVA AEPAVLETVP NAEISAEATT EASAEVKEAE VAATEAPSAD
LLANPEGPTE PKPVLLWRPG GRNDNQRQAG RPQQGDRRPQ GAKRAPQQGE QKPDARTEGA
RTEGGRDRDD KRREGGHHGK PRDRDGNRDK HANRGDRDDR GPRKDRDRDT KSTQPQRFEA
KPPRKEKPID PDSPFAKLAA LKEQMKK