Gene BTH_II0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II0071 
Symbol 
ID3846772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp71434 
End bp73059 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content67% 
IMG OID637837377 
Producthemagglutinin-related protein 
Protein accessionYP_438273 
Protein GI83717342 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGTCG CGGGCGCCGG CCTGAATGCG TCGAACATCG ATCAGGTCGA CCTGATCGCA 
CGTGCGGTGC AGATGAACGC AGCGGTCTAC GCGAAGAACC TGAACGTGAT CACAGGCGCG
AGCCAGGTCA ACCGCGACAC GCTCGCCGCG ACGCCGATCG CCGGCGAAGG TCCTGCTCCG
GCAGTCGCGA TCGACGTCGG CCAACTGGGC GGGATGTATA GCAATCGAAT CTTCCTCGCA
TCGAATGAGA ACGGTGTGGG GGTGGCGAAT GCCGGCACGA TCGCTGCGCA GGCGGGCGAC
CTGACACTGC AGGTGAATGG CCGACTCGTC CTCACCGGCA GGACGACTGC GAGCGGCAAC
CTTGCGTTGT CGGCCGCGGG CGGAATTCAG AATAGCGGCA CGACGTACGC GCAGCAATCG
CTGTCGGCCA GCACGAGCGC CGATCTCACG AACAGCGGGA CGCTCGCGGC GCAGCAGAAT
ACGACGGTGA ACGCGGGCAG CGTCAATTCG ACGGGCACGC TCGGCGCGGG CGTGAACAAC
GACGGCTCGG TGGCGCGCAG TGGTGAACTG AAGCTGACGG CCTCGGGCCA ACTGAGCGCA
ATCGGCCGGA ATGTCGCGGG CGGCAACGCG TCGCTGACGG GCGGCAGCGT GAACCTCGCC
GGCAGCCAGA CGGCCGCGAA CGGCAATCTG TCGCTGAACG CGACGAGCGG CGACATGAAC
CTGTCGAACG CGACGACAAG CGCCCAAGGG GCCGTGACTG CGAATGCGAC GGGAACGGTG
ATCAACGATC GCGGCAATCT GTCGAGCGGC GCAGGCACAA CGCTTGCCGT CGGCAGCCTC
TCGAACCAGG GCGGCAAGGT GTCGTCGCAG GGGGCGCTGT CGGTGACGGC CGCCGGCCAA
ATCGCCAATC AGTCCGGCGA ATTGGTGTCC CAGAGCACGA TGAACATGCA TGGCGGCACC
CTCGCGAACA ACCAGGGCAC CATTCAAAGC GCGGCGGGCA TGACGGTGGC CGGGGTGTCG
GTGGACAACA CGGCGGGCCG AATCACGTCA CTCAATGGCG ATGGCCTGTC GATCACGGCG
ACTGGCCAAC TAACCAATGC AGCCGGCACG ACGGCGAACG GCGCGCAAGG CGGCGTCATC
GGCGGCAACG GCGCCGTCAC CGTGCAGGGC GGCAACGTCG CCAACCACGG AAGGATCACG
TCCAATGCGA ACCTGCGCGT CTCGGGCCAG TCGGTCGACA ACGGCCGAGG CACGCTGCAG
GCCGCGCAAA ACGTCGCGGT GGATGCGGGT GCGCGACTGA CGAACGACGG CGGCTCGATT
GTCGGCCAGA CCGCGGCGCT CAGCGGAACG ACGCTCGACA ACCGTGCCGG CACCGTGCAG
GCCGGTCAAC TGTCGTTGAA CGCGACCGAC CTCGCGAACC ATGCCGGCAC GATCACGCAG
ACCGGCACCG GCGCGATGGC CGTCAATGTG TCGAGCACGC TCGACAACTC CGGCGGCGGC
ACGCTGCAAA CCAACAGTAC CGACCTGACG CTCGCCCCCG CTTCGCTGAT TAACGACGGC
GGCACGATCA CCCATGCCGG TAACGGCACG CTTACGCGGA CCTGCAGACG ATGTACCAAT
CGCTGA
 
Protein sequence
MTVAGAGLNA SNIDQVDLIA RAVQMNAAVY AKNLNVITGA SQVNRDTLAA TPIAGEGPAP 
AVAIDVGQLG GMYSNRIFLA SNENGVGVAN AGTIAAQAGD LTLQVNGRLV LTGRTTASGN
LALSAAGGIQ NSGTTYAQQS LSASTSADLT NSGTLAAQQN TTVNAGSVNS TGTLGAGVNN
DGSVARSGEL KLTASGQLSA IGRNVAGGNA SLTGGSVNLA GSQTAANGNL SLNATSGDMN
LSNATTSAQG AVTANATGTV INDRGNLSSG AGTTLAVGSL SNQGGKVSSQ GALSVTAAGQ
IANQSGELVS QSTMNMHGGT LANNQGTIQS AAGMTVAGVS VDNTAGRITS LNGDGLSITA
TGQLTNAAGT TANGAQGGVI GGNGAVTVQG GNVANHGRIT SNANLRVSGQ SVDNGRGTLQ
AAQNVAVDAG ARLTNDGGSI VGQTAALSGT TLDNRAGTVQ AGQLSLNATD LANHAGTITQ
TGTGAMAVNV SSTLDNSGGG TLQTNSTDLT LAPASLINDG GTITHAGNGT LTRTCRRCTN
R