Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_C7612 |
Symbol | |
ID | 3734674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007509 |
Strand | - |
Start bp | 1251227 |
End bp | 1254571 |
Gene Length | 3345 bp |
Protein Length | 1114 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637761313 |
Product | Rhs element Vgr protein |
Protein accession | YP_367300 |
Protein GI | 78060725 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria [COG4253] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATTG CTGCTGCATT GAAGGGCTAT TCGCAGGCGA CTCGCCAGAT CCAGATCGAT ACGGCGATGC CGGGCGCTTT CGCGGTCGAG CGGTTTCATG GCCGGGAAGC GATGGACGAG TCCTTCCGGT TCGAGATCGA CGTGCTGTCG AGCACGCCGT TCCTCGACCT GAATCCGCTG CTCGGCACGG CGATCCGCCT GCGTCTCGCG ACCGGCGCGG GCGAGCGCTG CTGGAACGGG TACGTCGTGC GGGCCGCGTA CAGCGACAGC GACGGCGAGA TCACGCGCTA CCGGCTGACC ATGGCCTCCT GGCTCGAACT GCTCCGGCTG CGCCGCAACT GCCTGTATTT CGTCGGCCTC GATACCGAGG GCATCTGCGA GCGCGTGTTC GGCGACTATC CGGAGGCGCA TCGGCGCTAT GAACTGAAGG AGCCGTTGCG CACGTTCGAT CTGCGTGGCC AGTATCGCGA AACCGACTTC GACTTCGTGA TGCGCCAGTT GTCCGAGGCC GGCCTGTCGT TTCGGATCGA GCATGCACAG GACGCCGGCG AGAAACCGTC CGGCAACCAT ACCGTCGTGG TGTTCGACCG GCGCGCCGAA CCGCCCAAAG GCAGCACCGT CGCCTATAAC CGGCAGGACG TGGGCGACCC CGACGGCGTA TTGACCTATT TCACCACGCG GCACCAACTG GTCCCGGATC GCGTCACGGC CGCGAGCTGG AAAGCCAGCA ATCTCGTTGC GCTCGCCGGG CATGCCGAGG GCGAGGCCGA TCGCGATGCA CCCGCGATGC CCGCGCGCGA AGTGCTGGAC GCGCAGCGTG CCGGCCGCTT CGAGACGTCG GACCAGGCGC AGCGCTACGC GTCGCAGCGT CTGGACGCGC TGCGCCTGTC GAAGCGGATT CACTATGGCG CCGGCTCGTC GCGTACGCTG GAGATCGGCA AGGTTCACAC GCTGACCGGC TATCCGGACG GTACGGTTTC GTTCGTCCCG CTGACGCTCG AGCACGAGGC GGTCAACAAT CTCGGTGCCG ATATCGCGCA ACTGCTCGAG CACGGCGAAC TGGAGCAGGG GCTCTATCGC AACCGCTTTG CCGCGGTGCC GCCGGGCGTG CCGATCGTGC CGCCTCATCG CGACCGGCCG GTCGTGCAGG GCGTGCAGAC CGCGATCGTC GTCGGCGAGC CGTCGAACCG GGTGAGCAGC ACGCGCGACC ACCAGGTGCG CGTGCAGTTT CCGTGGATGC GCGGCACGGC GCCGCTGCCG GGCGGACTGA CCGACACCGC GAGCCGGTCG AACCCGCAGG GGCATGCGCC CGGCGACCAT CGATCCGGCG TGGTTGCGCG CATCGCCGAA CAGGCGGCGG GCCCGAACTT CGGCCACAGC TTTACGCCGC GCATCGGCGC GGAAGTCGTG GTCGGCTTCG ACTCGGGAAA CATCGACATG CCGGTCGTGC TGGGCCAGCT TTACGGCGGC CGCGTGCAGC CCCCGTTCGC GGCCGGTGAA GGCAGCAGCG CGAATCACGC GGGCGTGCTG ACCGGCATGC AGACGCAGAC ACTCGACGGC ACGGCGGGCA GCCGCTGGGT GATGGACGAT GCGTCGGGCC AGTTGCGTCA TGAACTGGGC AACAGCGTCG CGAACAGCCG GCTTGCGCAA GGCTATCTGA TCGATCAGCA GGGCGCGGTT CGCGGTGCCT ATCGTGGCGA AGGTTTTGAT CTCGCGACCG AAGGCTGGGG CGTCGTGCGG GCGGGCGACG GTGTGCTCGT GTCGGGCACC GCGCGCACGG AAGCGGCTTC GACGCAAATG GATCTGGGAG AAAGCGTTGC CCAGTTGAAG CAGGCGGTGA AGACCGCACA GGGTCTCGAC GAGGCGGCGG CGCGTGCAAC GGCCGGCCGG CTGACGGCGA ACGCCGCGCA GGCGGATTTC CTGAAGGCGA TCGATCCCGC GCAGGACGGC AAGTACACGG GTGCGGTGAA CGGGCAGAGC GCGACCAAAC CGGCTGCCGG CGGAACCGGC GGCAGCGGGG ATCCGGTCGA GCGCTTCGCG GTGCCGGCCG TCGTGCTGGA GTCGCCGCAA AACGTCGTGA TGAGTACCGG CAACAGTGCG GTGTCCTATG CGGAGAAGCA CGTGCACCTG ACCGCGCAGG GCGATGCGCA TCTGGCTGCC GGCGCGACGG TCGCCGGTGC GTCCGGCGAT GCGGCGAGCG TGTACGCGGC CGCAGGCGGA ATCAAGGCCG TGGCCGGCCA CGGGCCGGTC AGTGTCGAGG CGCATGCGTC GTCGATGCAG ATCCTGGCCG ACCAGTCGGT GTGCATCACG TCGTCGGATG ACCGGATCGA CGTGCTCGCG AAGGATGCGA TCGTGCTTCA GCAAGGGCCG AGCCGGATCA CGCTGAAGGG CGCGGACATC CTGGTCGAGA CGCCCGGATC GTTTGCCGTC AAGGCGGGTG CCCATCCGTT CATGGGGCCG GGTGCGCAGT CGCCGGTGCT GCCGGCCTTC CCGATTCCCG TGCCGCTCGC GCTTTACGAC GAGCAACTGC GCTTCGTCAA CGCGGACGGC GTGCCGCTGT CGAAGGTTGC CTATCAGTTG AAGCTGGCCG ACGGCACCAC GGCGTCGGGC GTGACCGATG ACGCGGGCAA GACCGAGCGC GTGGCGTCGG CGAGCCCGCT GGGGATCCTG TCGGCGCTGC TGACCCCGAC CCAGATGGTG GACTGCTGCG GACGCACGTC GGGTACGCCG CCAGCGCCGG TCGAAGTCAA GATCAAGGGC GTGCAGACGA ACCAGTTTCA GTTGGGGGAA TCGGAGAAGT CGGTCGAGGT CGACGCCCAT GAGCGGGTGC TGACCGCGGG CGAAATTGAA ATGGCCCGGA CAGTCTTCAA GGACGGTATC GACTACGACA AGGTGCGGGT GCACAAAGGG AGTTACTTCT GGTTCAACCT GCAAAACAAG AACACCGCCG TCACGCCGAA CGGGAAGATG TATTTCCTTG ACGACCTGTA CGTCGACGAT TTCTCGGCAA TGAACGGCCC GAACATCTGG AAAAGAAGCC TGTTCATGCA CGAAATGACC CATGTCTGGC AGTACCAGCT TGGGTATGCG GTGCGATGGC ATGCGCTGAC CGTCACGATT CGAGGACAGA GCGCATACGA ATACACCGTC GCGCCAGGTG CGGTATTTCA CGATTACAAC ATGGAGCAGC AGGGCAACCT GGTCGCGGAT TACTACGCCG TGCAGGTGCT GAAGGCGCCT TTCGCCGTCT TTCATCGTGG CTACGTCGGC ACGCCGTTCG AGCTCGATCA TGTCCTTGCT CCGCTCCTCG AGGATCCCAA GAATGCAGAC AATCTTCCGA AGTAG
|
Protein sequence | MDIAAALKGY SQATRQIQID TAMPGAFAVE RFHGREAMDE SFRFEIDVLS STPFLDLNPL LGTAIRLRLA TGAGERCWNG YVVRAAYSDS DGEITRYRLT MASWLELLRL RRNCLYFVGL DTEGICERVF GDYPEAHRRY ELKEPLRTFD LRGQYRETDF DFVMRQLSEA GLSFRIEHAQ DAGEKPSGNH TVVVFDRRAE PPKGSTVAYN RQDVGDPDGV LTYFTTRHQL VPDRVTAASW KASNLVALAG HAEGEADRDA PAMPAREVLD AQRAGRFETS DQAQRYASQR LDALRLSKRI HYGAGSSRTL EIGKVHTLTG YPDGTVSFVP LTLEHEAVNN LGADIAQLLE HGELEQGLYR NRFAAVPPGV PIVPPHRDRP VVQGVQTAIV VGEPSNRVSS TRDHQVRVQF PWMRGTAPLP GGLTDTASRS NPQGHAPGDH RSGVVARIAE QAAGPNFGHS FTPRIGAEVV VGFDSGNIDM PVVLGQLYGG RVQPPFAAGE GSSANHAGVL TGMQTQTLDG TAGSRWVMDD ASGQLRHELG NSVANSRLAQ GYLIDQQGAV RGAYRGEGFD LATEGWGVVR AGDGVLVSGT ARTEAASTQM DLGESVAQLK QAVKTAQGLD EAAARATAGR LTANAAQADF LKAIDPAQDG KYTGAVNGQS ATKPAAGGTG GSGDPVERFA VPAVVLESPQ NVVMSTGNSA VSYAEKHVHL TAQGDAHLAA GATVAGASGD AASVYAAAGG IKAVAGHGPV SVEAHASSMQ ILADQSVCIT SSDDRIDVLA KDAIVLQQGP SRITLKGADI LVETPGSFAV KAGAHPFMGP GAQSPVLPAF PIPVPLALYD EQLRFVNADG VPLSKVAYQL KLADGTTASG VTDDAGKTER VASASPLGIL SALLTPTQMV DCCGRTSGTP PAPVEVKIKG VQTNQFQLGE SEKSVEVDAH ERVLTAGEIE MARTVFKDGI DYDKVRVHKG SYFWFNLQNK NTAVTPNGKM YFLDDLYVDD FSAMNGPNIW KRSLFMHEMT HVWQYQLGYA VRWHALTVTI RGQSAYEYTV APGAVFHDYN MEQQGNLVAD YYAVQVLKAP FAVFHRGYVG TPFELDHVLA PLLEDPKNAD NLPK
|
| |