Gene BURPS1710b_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2044 
SymbolcpaF 
ID3689881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2210839 
End bp2212404 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content71% 
IMG OID637728500 
Productcomponent of type IV pilus 
Protein accessionYP_333439 
Protein GI76810428 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.101478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGCC GGCGTCCGGC CGCTTCATTC GCGCCCGGCG ACGCCGGCGG CGCGCACGAG 
TTCGCGCCCG ACGCCGCGCC GGCGGGTGCG GCGCCGGCCG GCGGCCACGC ATCGGCGCCC
GACGCGAGCG GCGCGGCACG CGCGCGCGAG CCGGCCGGCG TATCGGGCGC GAGCGCGCCC
GGCGGGGCGC CGCAGTCCGG CGTCGCGCCA AGCGGCCATC GGCCCGCATC GCGTGACGAC
CGCATCGACC GCATCGATCG CGATGACGGC CACGACCGCG CGCCCGCGCA AGACCACCAC
GAGGCGCTGA TCCGCTCGGA GACGTTCAAG ACGATCCGCG CGGTCGTGTT CTCGTCGATG
AACATGTCGG CCGCGCTGAT GATGTCGCGC GCCGAAGTGC GCGAAGGCAT CGAGCAGGCG
GCCGCCGACG TGATCGCGCG CGAGCGGCTG ACGGTGACGC TCGCCGAGCA GGCGCTCATC
GTCGACGAGA TCCTCAACGA CATGTTCGGC GTCGGGCCGA TCGAGCCGTT GCTCGCCGAC
GAACGCGTGA CCGACATCCT CGTCAACGGC CCCGATCAGG TGTACGTCGA GCGCGCCGGC
AAGCTCGAGC TCACGCCGCT GAAGTTCCGC GACAACGCGC ACGTGATCAA CGTCGCGCAG
CGGATCGCGG CGGCGGTCGG GCGGCGCGTC GACGAGAGCA GCCCGATGGT CGACGCGCGG
CTCGCGGACG GCAGCCGCGT GAACGTCGTG CTGCCGCCGA TCGCGATCCG CGGCGCGTCG
ATCTCGATCC GCAAGTTCGC CAAGCGCAAC ATCACGCTCG CGCGGATGGC GCAGCAGGGC
AACCTGTCGC AGGCGATGGT CGAGGTGCTG AAGATCGCGA GCGCGTGCCG GCTGAACATC
GTGATCTCGG GCGGCACGGG CTCCGGCAAG ACGACGCTGC TGAACGCGCT GTCGCACTTC
ATCGATTCGC ACGAGCGCAT CGTGACGATC GAGGACGCCG CGGAGCTGCA ATTGCAGCAG
CCGCACGTCG TGAGCCTCGA GACGCGCCCG GAGAACAGCG AGGGGCTGGG CGGCGTGTCG
CAGCGCGATC TCGTGCGCAA CGCGCTGCGC ATGCGCCCCG ATCGCATCAT CCTCGGCGAG
ACGCGCGGCC CGGAGGCGTT CGACGTGCTG CAGGCGATGA ACACCGGGCA CGACGGCTCG
ATGACGACGA TCCATGCGAA CACGCCGCGC GATGCGATCA CGCGCCTCGA GAGCATGGTG
ATGATGGCCA ACGGCAACCT GCCGCTCGTG TCGATCCGCC GGCAGATCGC GAGCGCGGTG
CACATGATCC TGCAGATCGA GCGCATGCGC GACGGCGTGC GGCGCGTCAC GCGCGTGACC
GAGATCGCCG GCATGGAGGG CGATGTCGTG ATCACGCAGG ATCTGTTCGC GTTCCGCTAC
GACGCGAGCG CGTTCCAGGA GCAGGTGCAC GGAATGTTCG AATCGTCGTC GCTGCGCCCG
GCGTTCGCGC AGCGCGCCGC GTATTACGGC CTCGAGGGCG CGCTGCTCGG CGCGTTGCAG
CCGTGA
 
Protein sequence
MFGRRPAASF APGDAGGAHE FAPDAAPAGA APAGGHASAP DASGAARARE PAGVSGASAP 
GGAPQSGVAP SGHRPASRDD RIDRIDRDDG HDRAPAQDHH EALIRSETFK TIRAVVFSSM
NMSAALMMSR AEVREGIEQA AADVIARERL TVTLAEQALI VDEILNDMFG VGPIEPLLAD
ERVTDILVNG PDQVYVERAG KLELTPLKFR DNAHVINVAQ RIAAAVGRRV DESSPMVDAR
LADGSRVNVV LPPIAIRGAS ISIRKFAKRN ITLARMAQQG NLSQAMVEVL KIASACRLNI
VISGGTGSGK TTLLNALSHF IDSHERIVTI EDAAELQLQQ PHVVSLETRP ENSEGLGGVS
QRDLVRNALR MRPDRIILGE TRGPEAFDVL QAMNTGHDGS MTTIHANTPR DAITRLESMV
MMANGNLPLV SIRRQIASAV HMILQIERMR DGVRRVTRVT EIAGMEGDVV ITQDLFAFRY
DASAFQEQVH GMFESSSLRP AFAQRAAYYG LEGALLGALQ P