Gene Bcep18194_A4376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4376 
Symbol 
ID3749575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1331983 
End bp1334916 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content69% 
IMG OID637762665 
Producthaemagglutinin and invasin like cell surface protein 
Protein accessionYP_368616 
Protein GI78065847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCA CTTTCCGTAC GATCTGGAAC GACGCACTGG GCGCCTGGGT CGCCGCCTCC 
GAAACCACCC GCGCCAACGG CAAGAAGACC TCGAGCAAGG TGTCGGCGTC CGCTCGGGTG
GCCGCCACCC TCTCCGTCGT CGCCGCCGCC GTCGCACCGC TCGACGCCGC GGCGCAAGCG
GCGCAACCCG GCACGACGTC CTACTACAGC GTGAACGACG GCGGCACCGC CGGCGCGAAC
GTCAACAACG ACGGCGCAAC GGGCCTGAAC GCACTCGCAG CCGGCGTCAA CGCGGCGGCA
GCGGGCGATG CCGACATTGC GATCGGTTCG GGTGCGACCA GCAGCAGCGG CTCGACGGGC
GCCGGCAACA TCGCGATCGG CCAGGATGCG CAGGCCCTGA CGCCTGGCGG CGGCTATGGC
GCAACCGCGC TCGGCGCGGG CGCGAAGGCC GGCACCGGCG GCTATACCGG TGCCACTGCA
GTCGGCTACA ACAGCACGGC CACCGAAAAC AGCACGGCAA TCGGCGCGAG CGCGATCTCG
AGCCAGACCG GCACGGCGCT CGGCCGCGCC GCGGAAGCGA CCGGGCGCGG CGCGCTCGGC
GCCGGATCGT CCGCCGCCGC GACCGCGACG AGCGCGATCG CACTGGGCGA CCGGGCGACG
AGTTCGGCCA ACACCAGCGT GTCGGTCGGC GCGCAGGCCA CCGCCAGCGC GCAGGCCGCG
AGTGCGATCG GGCCGCGTGC CGTCGCATCG GGCGCCGGCG CGGTCGCGCT CGGCGCGAGC
GCAACGGCCG CCCACGCGGG TTCGGTCGCG CTGGGCTCCG GCGCCGTGAC TGACGCAGCC
GTCGGTACGA GCGGTGCGAC GATCAACGGC ACCGCCTACG ACTTCGCCGG CATTGCGCCC
GCCTCGACCG TCAGCGTCGG CGCGGCAGGT CGTGAACGCA CGATCACCAA CGTCGCGGCC
GGCCGACTTG CGGAAAAAAG CACCGATGCC GTCAACGGCT CCCAGTTGAA CGCAACCAAC
CAGGCAGTCG CGGCGGTCGG CTCCAGCGTC ACGAGCCTGT CGACGTCGAC ATCGACGGGC
CTCTCCACCA CAAACGACAC GCTCGTGTCG TTGTCGACGT CGACCGCCGA CAGCCTGCGC
GTCGTCGACA GCAACATGGC GTCGCTATCG ACCGGCCTGA GCACGACCGA CAACACCGTC
GCGTCGCTGT CGACCGCGAC GTCGGCCGGT TTGTCGACGA CCAACAACAC ACTGGTTTCG
CTATCCACGT CCGCATCGAC CGGCATCGAC ACGCTCGGCC AGAACCTGAC GTCGTTGTCG
ACCGCAACAT CGACCACCGC CGGTTCCTTG TCCACCACGA TCAGTACAAC GAATGACAAT
CTCGTCTCGC TGTCGACCTC GACGTCGACC GGATTCGTGT CGCTGTCGAC CGGCCTGAGC
ACGACCAACG ACACTGTCAC GTCGTTGTCG ACCTCCACGT CGACGGGGCT GTCGACGACG
AACAGCACGC TCGCGTCGTT GTCGACGTCC GCATCGACGG GCATCAACTC GCTGTCGACC
AGCCTCAGCG CGACGGACAG CGCGATGACG TCGCTGTCCA CGTCGACGTC GTCGGGCCTC
GGCTCGCTGT CGACCAGCAT CAGCTCGGTC ACCGTCAACA CGACGAACCT CGGCACCAGC
ACCGCCGATG CGCTCGGCGG CGGCGCGACG TACGATCCGG CGACCGGCAA GATCACCGCG
CCATCGTACG TGACGTACAA CAACGACGGC ACGACGACGA CCAACGGCAA CGTGGGCGAC
GCCATCGACA ACCTCAATGC GAAGGGCGCG AAGTACTTCC ACGCGAACTC GACCGAAGCC
GACAGTCAGG CCACCGGCGC GAACAGCGTC GCAATCGGCC CGAACGCGAT CGCCAATATC
GACAACTCGG TGGCGATCGG CCATCGCTCG GTCACCGGTG CCGCCGTCGG CGTATCGTCG
TCGACCATCG GCGACCTGCA TTTCGGCGGC TATGCGGGCG CCAACCCGTT CGGCGTGTTC
AGCGTCGGCG CACCCGGCCA GGAACGCCAG ATCCAGAACG TCGCGGCCGG CCGCGTGAGC
GCCGACAGCA CCGACGCAAT CAACGGCAGC CAGCTGCACG CGACCAACCT GAACGTCGCA
TCGCTGTCGA CCGGGCTCAG CTCGACGAAC AGCAACCTCG CATCGCTGTC CACGTCGACG
TCGACGAGCA TCGGCTCGCT GTCGACCGGC CTGTCATCGA CCAACGAAGC GCTGGGCTCG
CTGTCGACTT CCACGTCGAC GAGCGTGACG TCGCTGTCCA CCGGCCTGTC GACGACCAAC
GACCGCGTGT CGTCGCTGTC GACCAGCGTG ACCAACATCA ACACGCAGAT CAACAACCTG
TCGACGTCGG CATCGCGCAA CACCGGCATC ACCGCGGACA TGAACGGCTC GGGCACCGAT
GCGCCGACCG TCACGGCCGG CTCCAACTCG GTCGCGATCG GCGCGAAATC GGACGACGGC
GGCCGCTCGA ACGTGGTGTC GGTCGGCAGT GCGGAGCAGC AGCGCCAGAT CGTCAACGTC
GCACCGGGCA CGCAGGGCAC CGACGCGGTC AACGTGAATC AGCTCACGCT GGCGACCGAA
TCGGCGAACC GCTACACCGA CCAGCGGGTC GGCGCGATTC AGCAAGGCGT GAACGACCTC
GCGCGCAATG CATATTCGGG CATCGCGATC GCCGGCGCAC TCGCGGGCAT GCCTCAGGTC
GATCCGGGCA AGGTGATCTC GGTCGGCGCC GGGTTCGGCA ACTACGGCGG CTACACGGCG
ATCGCGGTCG GCGGCAGCGC GCGGATCGCG CAGAACACCG TGATCAAGCT GGGGGTCGGC
ACGGTCAACG GCTCGCGCAT GATGGTCAAC GGCGGCATCG GCCATTCGTG GTGA
 
Protein sequence
MNRTFRTIWN DALGAWVAAS ETTRANGKKT SSKVSASARV AATLSVVAAA VAPLDAAAQA 
AQPGTTSYYS VNDGGTAGAN VNNDGATGLN ALAAGVNAAA AGDADIAIGS GATSSSGSTG
AGNIAIGQDA QALTPGGGYG ATALGAGAKA GTGGYTGATA VGYNSTATEN STAIGASAIS
SQTGTALGRA AEATGRGALG AGSSAAATAT SAIALGDRAT SSANTSVSVG AQATASAQAA
SAIGPRAVAS GAGAVALGAS ATAAHAGSVA LGSGAVTDAA VGTSGATING TAYDFAGIAP
ASTVSVGAAG RERTITNVAA GRLAEKSTDA VNGSQLNATN QAVAAVGSSV TSLSTSTSTG
LSTTNDTLVS LSTSTADSLR VVDSNMASLS TGLSTTDNTV ASLSTATSAG LSTTNNTLVS
LSTSASTGID TLGQNLTSLS TATSTTAGSL STTISTTNDN LVSLSTSTST GFVSLSTGLS
TTNDTVTSLS TSTSTGLSTT NSTLASLSTS ASTGINSLST SLSATDSAMT SLSTSTSSGL
GSLSTSISSV TVNTTNLGTS TADALGGGAT YDPATGKITA PSYVTYNNDG TTTTNGNVGD
AIDNLNAKGA KYFHANSTEA DSQATGANSV AIGPNAIANI DNSVAIGHRS VTGAAVGVSS
STIGDLHFGG YAGANPFGVF SVGAPGQERQ IQNVAAGRVS ADSTDAINGS QLHATNLNVA
SLSTGLSSTN SNLASLSTST STSIGSLSTG LSSTNEALGS LSTSTSTSVT SLSTGLSTTN
DRVSSLSTSV TNINTQINNL STSASRNTGI TADMNGSGTD APTVTAGSNS VAIGAKSDDG
GRSNVVSVGS AEQQRQIVNV APGTQGTDAV NVNQLTLATE SANRYTDQRV GAIQQGVNDL
ARNAYSGIAI AGALAGMPQV DPGKVISVGA GFGNYGGYTA IAVGGSARIA QNTVIKLGVG
TVNGSRMMVN GGIGHSW