Gene Bcep18194_C7609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C7609 
Symbol 
ID3734574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp1247197 
End bp1250040 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content70% 
IMG OID637761310 
ProductRhs element Vgr protein 
Protein accessionYP_367297 
Protein GI78060722 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.823228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACGA ATATGGACGT TCTGAAAAAA CTCGCCGGAG ACTGGTCGCA GTACGACCGG 
TTTCTCTGGG TCACCACGCC GCTGGGCGCG AACGCGCTGG TCGCCGAGAG TCTGCACGGG
TGGGAGGCGG TCGATCACGG CGGCTACCGG TTTCAGTTGA CGGCATTGTC CGGGAACGCG
GCGTTGCCGC TGGAGCGCCT GATCGGCGCG CCGATCCTGG TCGAATGGCG TGCGCGTGAA
GGCAGCGACG TCCGCCGCCC CGTTCACGGT CACATCATTG CGGCGGAGCG GATCGGCTAC
AACGGCGGGC TGGCCCGCTT TCGTCTCGAA GTCGAGCCGT GGCTCGCGGT GCTCCGCCAG
CGCGTCGATC ACTACAATTT CCAGAACGCC AGCGTGCTCG ACATCAGCGA GCAGATTTTC
GGCTATCACA CCGCGGGCAC CGTGGCACCC GCGTGGCGCT GGGCGCTGGC CGATCCCGCC
AGATACCGCA AGCGCAGCCT GACCACCCAG GCCGGTGAAT CCGACTTCGA TTTCCTGCAG
CGGCTGTGGG CCGAAGAGGG CATCTTCTAC TGGTTCGAGC ATGCCGGCGA CGCCCATACG
TCCAGCCTCG GCAAGCACAC GCTCGTGCTC GCCGATTCGA ACCAGCACTT CACGCCGGAC
AAGCCGGAGG TCATCGGCTT TCACCAGACG AGCGACGGCG ATCCGGCGGG CAGCATCCAG
CATTTCATGA CCGCGCGTCG CTGGCGGATC GGCCGGGTCG CACGCGCGAG CTGGGATCAC
CGCACGCTGT CGAACCGGTT GACCGGCGCG CAGGCCGACG GCGTCGCGAT GGCTGGCGAA
GATCGTGACG TTGCCGGCCC GTATGCGTTC CAGACCGCCG CACTCGGCGA TCAGCGCGCG
CGCCAGCAAC TCGACGCGCA ACGCGTCGCC GCGCTGTGCA GCGAAGGGCG CAGCACCTGC
ATGGCGCTGC ATCCGGGAAT GCGTTTCGCG ATCAGCGGGC ATCCGGCGCT GAAGGCGTCG
GACGCGTTCG TGTGCCTGCG CGTTCAGCAC AGCGTGCGCG CGAACGTCGA TGCGGACGTG
CACGCCGCGA TCGAGCAGAC GCTCGGCGAC ATACCGCCGA TGCTGGATAC CGACGTGAAC
GAGTACGGCG TCGCCCATGC GCTGCACGCC GCGCTCGGCA GCGGGACGCA CGATGGCGCG
CCGCTGGTGG GCAACGAAGC GGTCTATGCG AACAGCTTCG TCGCGTTGCC GGCCGGACAG
ACGTACCGGC CGCTGAGCGA AGGCGGTCAC GGCGCACGCT CGCATCCCGT GGTGGCCGTA
CAGGGTGCGC AAACGGCGAT CGTCGTCGGC GCGGGCGATC CGGTTCACAC CGATCGCGAT
CACCGGGTCC GGATCCAGCA CCATGCGCAG CGCGGGCAGA AGGCCGCGAG TCTCGAGGAT
CATCCGCACG CGGCCAACGC GCCGGCCGAC CGGAATGCCG GCACATGGAC ACGCGTGCTG
ACGCCGGTCG GCGGCGACAA CTGGGGCGGC GTGACCGTGC CGCGCGTCGG GCAGGAAGTG
TGGACCGACT GGCTCGAGGG GCAACCGGAC CGGCCGGTGG TGGTGGGTTC GCTGTACAAC
GGCCAGGGCA ATGCCGATGC GCAGCACAAC GCGCAAGCGG GCGGTCCGGC CAGCAGTACC
GGCAATGCGG CCGCGTGGTT CGCAGGCAAT GGCCACCCCG CGGCACTGAC GGGCATCAAG
ACGCAGGACC TGAACCTGAG CCAGCAAGGG ACGGGCGGCT ATCGGCAGTT CCTGCTCGAC
GACACGGCCG GCGAGGCGAG TGCGCGTCTT TATACGACGG ACCACAACAG CGGGTTGACG
CTGGGTCATC TCAAGCAGAT TCAGGACAAC CAGCGGCAAG CCGATCGCGG GTATGGCATG
GAGTTGACGA CCGACGCTGC CGCTGCGTTG CGCGGCGGCT CGGGCATGCT CGTCAGCGCG
GCGCGTGGCG CCAGCCAGAT GGACGCGAGT GCGGCGGGCC AGGTGCTGAC CCAGAACCGT
CAACTGCTCG ACGGCCTCGC GGAGGCGGCG CGCGCGCAAG GTGCCGAGCC GGCACAGCCC
GCGGCAAGCG TCGGCGCGGG GGCGGATGAC GCGGGGCATG CGACGGGGCC CGGCACGTCG
ACGCAATCGG CCGCCGTGAC CGGGTTGCAG CAGAGCGAGC AGGCGCTGGC CGAAAAGCGC
GACGGCCGAA CGTCTGCCCA GGCCGGCGGC GGAAGCGGTA GCGCTGCCGC ATGGTCGCGT
CCGCATGTCG TCGCGCATGG CGCGGACGGT CTCGCCGCCG TGACGCCCGG CAGCCAGGTC
TGGGTGTCGG GGACGGAGAC CGTGCTGAGC GCCGGCCAGG ATCTGCAATG GACGGCCAAG
GGCAAGACAA CCCTGGCGGC GACGCACGGC GTCGCGTTCT ATACACAGGG CAACGCCGCG
GGCGAGCGGC CCGTCGCCGG GCAAGGCATC GCGTTGCACG CGGCCTCCGG CGCGGTAAGC
CTGCAGGCGC AGAATGCGGG CACGCTCGGC GCGGCCGCGC AGCAGGCGGT CGTGCTGTCG
AGCAGCCAGG GCGCGGCGAA TCTGCAGGCA CCGAAGCGTC TGCTGCTGAA TGCCGCGAAA
GCGTACCTGA AGATGGAGGG CGGCAATATC GAGGTCGGCG CGCCGGGGCG GGTGGAATTC
AAGTCGGCGC AGCGCGAACT GACCGGGCCG CGAGGCGCGG GCGGGCAAAC CTCGTTGGGC
AGCAGCAGCG CGAAGGATTG CCAGTTGCGC CTGTCGGGCG CGGCCGCGAG CCACGACAGC
GTCGTCATGC TGCCGGCGGG CTGA
 
Protein sequence
MSTNMDVLKK LAGDWSQYDR FLWVTTPLGA NALVAESLHG WEAVDHGGYR FQLTALSGNA 
ALPLERLIGA PILVEWRARE GSDVRRPVHG HIIAAERIGY NGGLARFRLE VEPWLAVLRQ
RVDHYNFQNA SVLDISEQIF GYHTAGTVAP AWRWALADPA RYRKRSLTTQ AGESDFDFLQ
RLWAEEGIFY WFEHAGDAHT SSLGKHTLVL ADSNQHFTPD KPEVIGFHQT SDGDPAGSIQ
HFMTARRWRI GRVARASWDH RTLSNRLTGA QADGVAMAGE DRDVAGPYAF QTAALGDQRA
RQQLDAQRVA ALCSEGRSTC MALHPGMRFA ISGHPALKAS DAFVCLRVQH SVRANVDADV
HAAIEQTLGD IPPMLDTDVN EYGVAHALHA ALGSGTHDGA PLVGNEAVYA NSFVALPAGQ
TYRPLSEGGH GARSHPVVAV QGAQTAIVVG AGDPVHTDRD HRVRIQHHAQ RGQKAASLED
HPHAANAPAD RNAGTWTRVL TPVGGDNWGG VTVPRVGQEV WTDWLEGQPD RPVVVGSLYN
GQGNADAQHN AQAGGPASST GNAAAWFAGN GHPAALTGIK TQDLNLSQQG TGGYRQFLLD
DTAGEASARL YTTDHNSGLT LGHLKQIQDN QRQADRGYGM ELTTDAAAAL RGGSGMLVSA
ARGASQMDAS AAGQVLTQNR QLLDGLAEAA RAQGAEPAQP AASVGAGADD AGHATGPGTS
TQSAAVTGLQ QSEQALAEKR DGRTSAQAGG GSGSAAAWSR PHVVAHGADG LAAVTPGSQV
WVSGTETVLS AGQDLQWTAK GKTTLAATHG VAFYTQGNAA GERPVAGQGI ALHAASGAVS
LQAQNAGTLG AAAQQAVVLS SSQGAANLQA PKRLLLNAAK AYLKMEGGNI EVGAPGRVEF
KSAQRELTGP RGAGGQTSLG SSSAKDCQLR LSGAAASHDS VVMLPAG