Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0071 |
Symbol | |
ID | 3846772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 71434 |
End bp | 73059 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637837377 |
Product | hemagglutinin-related protein |
Protein accession | YP_438273 |
Protein GI | 83717342 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGGTCG CGGGCGCCGG CCTGAATGCG TCGAACATCG ATCAGGTCGA CCTGATCGCA CGTGCGGTGC AGATGAACGC AGCGGTCTAC GCGAAGAACC TGAACGTGAT CACAGGCGCG AGCCAGGTCA ACCGCGACAC GCTCGCCGCG ACGCCGATCG CCGGCGAAGG TCCTGCTCCG GCAGTCGCGA TCGACGTCGG CCAACTGGGC GGGATGTATA GCAATCGAAT CTTCCTCGCA TCGAATGAGA ACGGTGTGGG GGTGGCGAAT GCCGGCACGA TCGCTGCGCA GGCGGGCGAC CTGACACTGC AGGTGAATGG CCGACTCGTC CTCACCGGCA GGACGACTGC GAGCGGCAAC CTTGCGTTGT CGGCCGCGGG CGGAATTCAG AATAGCGGCA CGACGTACGC GCAGCAATCG CTGTCGGCCA GCACGAGCGC CGATCTCACG AACAGCGGGA CGCTCGCGGC GCAGCAGAAT ACGACGGTGA ACGCGGGCAG CGTCAATTCG ACGGGCACGC TCGGCGCGGG CGTGAACAAC GACGGCTCGG TGGCGCGCAG TGGTGAACTG AAGCTGACGG CCTCGGGCCA ACTGAGCGCA ATCGGCCGGA ATGTCGCGGG CGGCAACGCG TCGCTGACGG GCGGCAGCGT GAACCTCGCC GGCAGCCAGA CGGCCGCGAA CGGCAATCTG TCGCTGAACG CGACGAGCGG CGACATGAAC CTGTCGAACG CGACGACAAG CGCCCAAGGG GCCGTGACTG CGAATGCGAC GGGAACGGTG ATCAACGATC GCGGCAATCT GTCGAGCGGC GCAGGCACAA CGCTTGCCGT CGGCAGCCTC TCGAACCAGG GCGGCAAGGT GTCGTCGCAG GGGGCGCTGT CGGTGACGGC CGCCGGCCAA ATCGCCAATC AGTCCGGCGA ATTGGTGTCC CAGAGCACGA TGAACATGCA TGGCGGCACC CTCGCGAACA ACCAGGGCAC CATTCAAAGC GCGGCGGGCA TGACGGTGGC CGGGGTGTCG GTGGACAACA CGGCGGGCCG AATCACGTCA CTCAATGGCG ATGGCCTGTC GATCACGGCG ACTGGCCAAC TAACCAATGC AGCCGGCACG ACGGCGAACG GCGCGCAAGG CGGCGTCATC GGCGGCAACG GCGCCGTCAC CGTGCAGGGC GGCAACGTCG CCAACCACGG AAGGATCACG TCCAATGCGA ACCTGCGCGT CTCGGGCCAG TCGGTCGACA ACGGCCGAGG CACGCTGCAG GCCGCGCAAA ACGTCGCGGT GGATGCGGGT GCGCGACTGA CGAACGACGG CGGCTCGATT GTCGGCCAGA CCGCGGCGCT CAGCGGAACG ACGCTCGACA ACCGTGCCGG CACCGTGCAG GCCGGTCAAC TGTCGTTGAA CGCGACCGAC CTCGCGAACC ATGCCGGCAC GATCACGCAG ACCGGCACCG GCGCGATGGC CGTCAATGTG TCGAGCACGC TCGACAACTC CGGCGGCGGC ACGCTGCAAA CCAACAGTAC CGACCTGACG CTCGCCCCCG CTTCGCTGAT TAACGACGGC GGCACGATCA CCCATGCCGG TAACGGCACG CTTACGCGGA CCTGCAGACG ATGTACCAAT CGCTGA
|
Protein sequence | MTVAGAGLNA SNIDQVDLIA RAVQMNAAVY AKNLNVITGA SQVNRDTLAA TPIAGEGPAP AVAIDVGQLG GMYSNRIFLA SNENGVGVAN AGTIAAQAGD LTLQVNGRLV LTGRTTASGN LALSAAGGIQ NSGTTYAQQS LSASTSADLT NSGTLAAQQN TTVNAGSVNS TGTLGAGVNN DGSVARSGEL KLTASGQLSA IGRNVAGGNA SLTGGSVNLA GSQTAANGNL SLNATSGDMN LSNATTSAQG AVTANATGTV INDRGNLSSG AGTTLAVGSL SNQGGKVSSQ GALSVTAAGQ IANQSGELVS QSTMNMHGGT LANNQGTIQS AAGMTVAGVS VDNTAGRITS LNGDGLSITA TGQLTNAAGT TANGAQGGVI GGNGAVTVQG GNVANHGRIT SNANLRVSGQ SVDNGRGTLQ AAQNVAVDAG ARLTNDGGSI VGQTAALSGT TLDNRAGTVQ AGQLSLNATD LANHAGTITQ TGTGAMAVNV SSTLDNSGGG TLQTNSTDLT LAPASLINDG GTITHAGNGT LTRTCRRCTN R
|
| |