Gene BURPS1106A_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2087 
Symbol 
ID4901478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2081211 
End bp2082182 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content66% 
IMG OID640135317 
Productmajor fimbrial subunit protein 
Protein accessionYP_001066352 
Protein GI126452242 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTGA AACGCGAACG GCGTGCGCGG CGCTCCCGCC TGCCCGCGGC GGCAGGCGGC 
CTGTGGCTTG TCGTTGCCGC GCTGCCGCCC GGCGCACGGG CAGCGACTTG CGAAGGCGAC
AAGACACTCG TCACGCTTCC GGCCATCGCG GTCGCGGCCG ACGCGCCGGT GGGAACGGTG
CTGTGGCGTC AGAAGGGGAT CGCTTTCAGC ACCTATTGCA CGTTGGGATG GTTCGACACC
AGCAACATTT ATGTCTGGCG CGCCGACCTG CGCTCGGCGC TGCAGCCATA TGGGCTGACG
TTCTGGCTGA CTTACGGAGG GCAGGGCGGC AACACCGCGC TGCAAATCAA GGAGCCGATG
GTCGTCGATC TCGGCGGAAA GGCCGGCTAT GCGAGCGGCT CCGTCGACCT GGAACTGAGG
AAGACGGGCG TGACGCCGGC GCAAGGCGTC GTCGGCGCGG CGGACATCCC CGCGTTCTAT
CTCGATAGCA ATACGAACTA CAACAAAGGC TCGCACTACA TCCGCGGGCT GACCAACATT
TCGTTCGTCT CCTATACCTG CGACATCGAT ACGGGGTCGC GCAGCATGAA CGTGCCGCTC
GGCGACGTGC GCGTCGATCG CTTCAGCGGC ATCGGCTCCA CCTTCGCGGA TCGGAATTTC
GGCATCGGCA TGACGTGCAC GCAGCCGGCC GGCACGTACG ATATCGCGCT GACGTTTTCC
GCGACGGCGG ACAGCTCCGG CGCACCGGGC GTGCTCGCGA TTACGCAAGG GGCGTCTTCC
GCGTCCGGAG TCGGCATTCA GTTGCTGATG AACGGCTCGC CGGTGACTTT CGGCGCCGTC
CTCGACGCGG GCAGCGCGAC CGCGGGCGCG ACGCTGACGA TCCCGATGAC GGCACGCTAT
TATCAGACCG GCAGTGTCGT GACGCCGGGC GCGGCGAACG GGATCGCGAC GTTCGCCGTC
AGCTACAAGT GA
 
Protein sequence
MRLKRERRAR RSRLPAAAGG LWLVVAALPP GARAATCEGD KTLVTLPAIA VAADAPVGTV 
LWRQKGIAFS TYCTLGWFDT SNIYVWRADL RSALQPYGLT FWLTYGGQGG NTALQIKEPM
VVDLGGKAGY ASGSVDLELR KTGVTPAQGV VGAADIPAFY LDSNTNYNKG SHYIRGLTNI
SFVSYTCDID TGSRSMNVPL GDVRVDRFSG IGSTFADRNF GIGMTCTQPA GTYDIALTFS
ATADSSGAPG VLAITQGASS ASGVGIQLLM NGSPVTFGAV LDAGSATAGA TLTIPMTARY
YQTGSVVTPG AANGIATFAV SYK