Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A4376 |
Symbol | |
ID | 3749575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 1331983 |
End bp | 1334916 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637762665 |
Product | haemagglutinin and invasin like cell surface protein |
Protein accession | YP_368616 |
Protein GI | 78065847 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCA CTTTCCGTAC GATCTGGAAC GACGCACTGG GCGCCTGGGT CGCCGCCTCC GAAACCACCC GCGCCAACGG CAAGAAGACC TCGAGCAAGG TGTCGGCGTC CGCTCGGGTG GCCGCCACCC TCTCCGTCGT CGCCGCCGCC GTCGCACCGC TCGACGCCGC GGCGCAAGCG GCGCAACCCG GCACGACGTC CTACTACAGC GTGAACGACG GCGGCACCGC CGGCGCGAAC GTCAACAACG ACGGCGCAAC GGGCCTGAAC GCACTCGCAG CCGGCGTCAA CGCGGCGGCA GCGGGCGATG CCGACATTGC GATCGGTTCG GGTGCGACCA GCAGCAGCGG CTCGACGGGC GCCGGCAACA TCGCGATCGG CCAGGATGCG CAGGCCCTGA CGCCTGGCGG CGGCTATGGC GCAACCGCGC TCGGCGCGGG CGCGAAGGCC GGCACCGGCG GCTATACCGG TGCCACTGCA GTCGGCTACA ACAGCACGGC CACCGAAAAC AGCACGGCAA TCGGCGCGAG CGCGATCTCG AGCCAGACCG GCACGGCGCT CGGCCGCGCC GCGGAAGCGA CCGGGCGCGG CGCGCTCGGC GCCGGATCGT CCGCCGCCGC GACCGCGACG AGCGCGATCG CACTGGGCGA CCGGGCGACG AGTTCGGCCA ACACCAGCGT GTCGGTCGGC GCGCAGGCCA CCGCCAGCGC GCAGGCCGCG AGTGCGATCG GGCCGCGTGC CGTCGCATCG GGCGCCGGCG CGGTCGCGCT CGGCGCGAGC GCAACGGCCG CCCACGCGGG TTCGGTCGCG CTGGGCTCCG GCGCCGTGAC TGACGCAGCC GTCGGTACGA GCGGTGCGAC GATCAACGGC ACCGCCTACG ACTTCGCCGG CATTGCGCCC GCCTCGACCG TCAGCGTCGG CGCGGCAGGT CGTGAACGCA CGATCACCAA CGTCGCGGCC GGCCGACTTG CGGAAAAAAG CACCGATGCC GTCAACGGCT CCCAGTTGAA CGCAACCAAC CAGGCAGTCG CGGCGGTCGG CTCCAGCGTC ACGAGCCTGT CGACGTCGAC ATCGACGGGC CTCTCCACCA CAAACGACAC GCTCGTGTCG TTGTCGACGT CGACCGCCGA CAGCCTGCGC GTCGTCGACA GCAACATGGC GTCGCTATCG ACCGGCCTGA GCACGACCGA CAACACCGTC GCGTCGCTGT CGACCGCGAC GTCGGCCGGT TTGTCGACGA CCAACAACAC ACTGGTTTCG CTATCCACGT CCGCATCGAC CGGCATCGAC ACGCTCGGCC AGAACCTGAC GTCGTTGTCG ACCGCAACAT CGACCACCGC CGGTTCCTTG TCCACCACGA TCAGTACAAC GAATGACAAT CTCGTCTCGC TGTCGACCTC GACGTCGACC GGATTCGTGT CGCTGTCGAC CGGCCTGAGC ACGACCAACG ACACTGTCAC GTCGTTGTCG ACCTCCACGT CGACGGGGCT GTCGACGACG AACAGCACGC TCGCGTCGTT GTCGACGTCC GCATCGACGG GCATCAACTC GCTGTCGACC AGCCTCAGCG CGACGGACAG CGCGATGACG TCGCTGTCCA CGTCGACGTC GTCGGGCCTC GGCTCGCTGT CGACCAGCAT CAGCTCGGTC ACCGTCAACA CGACGAACCT CGGCACCAGC ACCGCCGATG CGCTCGGCGG CGGCGCGACG TACGATCCGG CGACCGGCAA GATCACCGCG CCATCGTACG TGACGTACAA CAACGACGGC ACGACGACGA CCAACGGCAA CGTGGGCGAC GCCATCGACA ACCTCAATGC GAAGGGCGCG AAGTACTTCC ACGCGAACTC GACCGAAGCC GACAGTCAGG CCACCGGCGC GAACAGCGTC GCAATCGGCC CGAACGCGAT CGCCAATATC GACAACTCGG TGGCGATCGG CCATCGCTCG GTCACCGGTG CCGCCGTCGG CGTATCGTCG TCGACCATCG GCGACCTGCA TTTCGGCGGC TATGCGGGCG CCAACCCGTT CGGCGTGTTC AGCGTCGGCG CACCCGGCCA GGAACGCCAG ATCCAGAACG TCGCGGCCGG CCGCGTGAGC GCCGACAGCA CCGACGCAAT CAACGGCAGC CAGCTGCACG CGACCAACCT GAACGTCGCA TCGCTGTCGA CCGGGCTCAG CTCGACGAAC AGCAACCTCG CATCGCTGTC CACGTCGACG TCGACGAGCA TCGGCTCGCT GTCGACCGGC CTGTCATCGA CCAACGAAGC GCTGGGCTCG CTGTCGACTT CCACGTCGAC GAGCGTGACG TCGCTGTCCA CCGGCCTGTC GACGACCAAC GACCGCGTGT CGTCGCTGTC GACCAGCGTG ACCAACATCA ACACGCAGAT CAACAACCTG TCGACGTCGG CATCGCGCAA CACCGGCATC ACCGCGGACA TGAACGGCTC GGGCACCGAT GCGCCGACCG TCACGGCCGG CTCCAACTCG GTCGCGATCG GCGCGAAATC GGACGACGGC GGCCGCTCGA ACGTGGTGTC GGTCGGCAGT GCGGAGCAGC AGCGCCAGAT CGTCAACGTC GCACCGGGCA CGCAGGGCAC CGACGCGGTC AACGTGAATC AGCTCACGCT GGCGACCGAA TCGGCGAACC GCTACACCGA CCAGCGGGTC GGCGCGATTC AGCAAGGCGT GAACGACCTC GCGCGCAATG CATATTCGGG CATCGCGATC GCCGGCGCAC TCGCGGGCAT GCCTCAGGTC GATCCGGGCA AGGTGATCTC GGTCGGCGCC GGGTTCGGCA ACTACGGCGG CTACACGGCG ATCGCGGTCG GCGGCAGCGC GCGGATCGCG CAGAACACCG TGATCAAGCT GGGGGTCGGC ACGGTCAACG GCTCGCGCAT GATGGTCAAC GGCGGCATCG GCCATTCGTG GTGA
|
Protein sequence | MNRTFRTIWN DALGAWVAAS ETTRANGKKT SSKVSASARV AATLSVVAAA VAPLDAAAQA AQPGTTSYYS VNDGGTAGAN VNNDGATGLN ALAAGVNAAA AGDADIAIGS GATSSSGSTG AGNIAIGQDA QALTPGGGYG ATALGAGAKA GTGGYTGATA VGYNSTATEN STAIGASAIS SQTGTALGRA AEATGRGALG AGSSAAATAT SAIALGDRAT SSANTSVSVG AQATASAQAA SAIGPRAVAS GAGAVALGAS ATAAHAGSVA LGSGAVTDAA VGTSGATING TAYDFAGIAP ASTVSVGAAG RERTITNVAA GRLAEKSTDA VNGSQLNATN QAVAAVGSSV TSLSTSTSTG LSTTNDTLVS LSTSTADSLR VVDSNMASLS TGLSTTDNTV ASLSTATSAG LSTTNNTLVS LSTSASTGID TLGQNLTSLS TATSTTAGSL STTISTTNDN LVSLSTSTST GFVSLSTGLS TTNDTVTSLS TSTSTGLSTT NSTLASLSTS ASTGINSLST SLSATDSAMT SLSTSTSSGL GSLSTSISSV TVNTTNLGTS TADALGGGAT YDPATGKITA PSYVTYNNDG TTTTNGNVGD AIDNLNAKGA KYFHANSTEA DSQATGANSV AIGPNAIANI DNSVAIGHRS VTGAAVGVSS STIGDLHFGG YAGANPFGVF SVGAPGQERQ IQNVAAGRVS ADSTDAINGS QLHATNLNVA SLSTGLSSTN SNLASLSTST STSIGSLSTG LSSTNEALGS LSTSTSTSVT SLSTGLSTTN DRVSSLSTSV TNINTQINNL STSASRNTGI TADMNGSGTD APTVTAGSNS VAIGAKSDDG GRSNVVSVGS AEQQRQIVNV APGTQGTDAV NVNQLTLATE SANRYTDQRV GAIQQGVNDL ARNAYSGIAI AGALAGMPQV DPGKVISVGA GFGNYGGYTA IAVGGSARIA QNTVIKLGVG TVNGSRMMVN GGIGHSW
|
| |