Gene BURPS668_A2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2118 
Symbol 
ID4888655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2051578 
End bp2052714 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content63% 
IMG OID640132055 
Producthemagglutinin domain-containing protein 
Protein accessionYP_001063112 
Protein GI126445307 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0929674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTTCTAT ACATCCGTAT GAAATATCAC CGTTTTCCCC GCTCTCATGC TCAACAAGAC 
ACCGGGCGAG CCGCATCGAC CGTTCCATTT CAGCGCTTCG CGCATCTACT ATGTTCGTCC
ATCGCTCCGC TGGCCCTCGG CTTTTCCACG GATGCGCTCG CTATCGAACA GGCTGAAAGT
ACGGCGTTTA ACGCGGTGAT CGATCAGATA AAAAAAGGTG ACTTTAAGTT GAAACCAGTT
GGGGACCGCA CGCTACCAAA CAAAGTCCCG CCACCGCCAC CGCCACCGCC GTCGACGACG
ACGCCACCGC CGCCACCGCC ACCGCCGCCG CCGCCGCCGT CGACGACGCC ACCACCGCCA
CCGCCGCCGT CGACGACGCC ATCGCCACCG CCACCGACGA CGACGCCACC GACGAGGACG
ACGCCATCGA CGACGACGCC GACACCATCG ATGCACCCGA TACAGCCGAC ACAACTGCCG
TCGATTCCTA ACGCGACACC AACCTCAGGA TCCGCGACAA ACGTCACCAT CAACTTCAAT
TCGACCGGTG CCTCAGCAAT GGGCACGAAC TCTATCGCCC TTGACTTCCA TGCACGCGCT
AAGGACAGCG ATTCGCTCGC GAGCGGACGG CTCGCTCATG CGAGCGGCCC CCGGTCAACC
GCGATCGGTG CCGAAGCAAA TGCGTCCGGT CAAAACACTG TCGCGCTCGG CGCTGGCTCC
ATAGCGGATC GTAACAACAC GGTATCCGTC GGTCGTCACG GTGACGAACG ACAAATAGTG
CACGTCGCAG CCGGCACGCA AGCCACCGAT GCCGTGAATG TCGGTCAGTT GAACCTCGCA
ATGTCGAACG CCAACGCGTA CACGAACCAG CGCATCGGCG ATCTTCAGCA GAGCATCACC
GACACCGCGC GCGACGCATA TTCCGGCGTC GCCGCCGCGA CCGCGCTGAC GATGATTCCC
GATGTCGACC GCGACAAGAG GGTGTCGATC GGCGTCGGCG GCGCGGTCTA CAAGGGCCAT
CGCGCCGTCG CGCTCGGCGG CACCGCGCGC ATCAACGAAA ACCTCAAGGT GCGGGCGGGC
GTCGCGATGA GCGCGGGCGG CAATGCCGTG GGCATCGGCA TGAGCTGGCA ATGGTAA
 
Protein sequence
MVLYIRMKYH RFPRSHAQQD TGRAASTVPF QRFAHLLCSS IAPLALGFST DALAIEQAES 
TAFNAVIDQI KKGDFKLKPV GDRTLPNKVP PPPPPPPSTT TPPPPPPPPP PPPSTTPPPP
PPPSTTPSPP PPTTTPPTRT TPSTTTPTPS MHPIQPTQLP SIPNATPTSG SATNVTINFN
STGASAMGTN SIALDFHARA KDSDSLASGR LAHASGPRST AIGAEANASG QNTVALGAGS
IADRNNTVSV GRHGDERQIV HVAAGTQATD AVNVGQLNLA MSNANAYTNQ RIGDLQQSIT
DTARDAYSGV AAATALTMIP DVDRDKRVSI GVGGAVYKGH RAVALGGTAR INENLKVRAG
VAMSAGGNAV GIGMSWQW