Gene BURPS668_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2030 
Symbol 
ID4881771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2024812 
End bp2025783 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content65% 
IMG OID640127958 
Producttype-1 fimbrial protein, A subunit 
Protein accessionYP_001059065 
Protein GI126438966 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTGA AACGCGAACG GCGTGCGCGG CGCCCCCGTC TGCATGCGGC GGCAGGCGGC 
CTGTGGCTTG TCGTTGCCGC GCTGCCGCCC GGCGCACGGG CAGCGACTTG CGAAGGCGAC
AAGACACTCG TCACGCTTCC GGCCATCGCC GTCGCGGCCG ATGCGCCGGT GGGAACGGTG
CTGTGGAGTC AGAAGGGGAT CGCTTTCAGC ACCTATTGCA CGTTGGGATG GTTCGACACC
AGCAACATTT ATGTCTGGCG CGCCGACCTG CGCTCGACGC TGCAGCAATA TGGGCTGACG
TTCTGGTTGA CTTACGGAGG GCAGGGCGGC AACACCGCGC TGCAAATCAA GGAGCCGATG
GTCGTCGACC TTGGCGGAAA GGCCGGCTAT GCGAGCGGCT CCGTCGGCCT GGAACTGAGG
AAGACGGGCA TGACGCCGGC GCAAGGCGTC GTCAGCGCGG CGGACATCCC CGCGTTCTAT
CTCGATAGCA ATACGAACTA CAACAAAGGC TCGCACTACA TCCGCGGGCT GACCAACATT
TCGTTCGTCT CCTATACCTG CGACATCGAT ACGGGATCGC GCAGCATGAA CGTGCCGCTC
GGCGACGTGC GCGTCGATCG CTTCAGCGGC ATCGGCTCCA CCTTCGCGGA TCGGAATTTC
AGCATCGGCA TGACGTGCAC GCAGCCGGCC GGCACGTACG ATATCGCGCT GACGTTTTCC
GCGACGGCGG ACAGCTCCGG CGCGCCGGGC GTGCTCGCGA TCACGCAAGG GGCGTCTTCC
GCGTCCGGAG TCGGCATTCA GTTGCTGATG AACGGCTCGC CGGTGACTTT CGGCACCGTC
CTCGACGCGG GCAGCGCGAC CGCGGGCGCG ACGCTGACGA TCCCGATGAC GGCACGCTAT
TATCAGACCG GCCGTGTCGT GACGCCGGGC GCGGCGAACG GGATCGCGAC ATTCGCCGTC
AGCTACAAAT GA
 
Protein sequence
MRLKRERRAR RPRLHAAAGG LWLVVAALPP GARAATCEGD KTLVTLPAIA VAADAPVGTV 
LWSQKGIAFS TYCTLGWFDT SNIYVWRADL RSTLQQYGLT FWLTYGGQGG NTALQIKEPM
VVDLGGKAGY ASGSVGLELR KTGMTPAQGV VSAADIPAFY LDSNTNYNKG SHYIRGLTNI
SFVSYTCDID TGSRSMNVPL GDVRVDRFSG IGSTFADRNF SIGMTCTQPA GTYDIALTFS
ATADSSGAPG VLAITQGASS ASGVGIQLLM NGSPVTFGTV LDAGSATAGA TLTIPMTARY
YQTGRVVTPG AANGIATFAV SYK