Gene Bcep18194_A4661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A4661 
Symbol 
ID3749864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp1651733 
End bp1653064 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content64% 
IMG OID637762953 
ProductFlp pilus assembly protein secretin CpaC 
Protein accessionYP_368900 
Protein GI78066131 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4964] Flp pilus assembly protein, secretin CpaC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000794738 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00340482 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAGA AACTGATTGG ATTAGCTATT GCATTGAGCG CGCTGGCCAT GCCGCCGCTG 
TCGGGCGCGG CCGACACGAG CGGGACGATC AGCCTGGCGG TCGGCGCACA ACGGCAGCTC
GCGACGGGGC GTGCGCTGCA GCGGGTTGCG GTCGGCGATC CGTCGGTCGC CGACGTGATG
GTCGTGAAGG GCGGGCGTGG CGGTGTCCTG CTGATAGCGA AGAGCGCCGG CGCGACGAAC
GTGATGATCT GGGAGCGCGG TCACGACGAA CCGACCGTCT ACAACGTGAG CGTCACGAGC
GGCGCGGCGC GTGCGTTGCT TGACGGTGGG ACGTCGAGCG TGAATACGTA CGGCGGCACG
ACGGTGATCG GCGGTGCGTC CGGGTCGCTC GACGGGCATC AGCGGGCGGT TCATGCGGCG
AAGACGGTCA GCGGCAAGGA CGGGACGACG ATCGATGCGT CGACGATCTC CGGCAAGAAC
GTCGTGCAGG TCGATGTGCG CGTCGTCGAA TTCAGCCGTT CGGTGCTGAA GCAGGCAGGG
CTGAACTTCT TCAAGCAAAG CAACGGTTTT TCGTTCGGTG CCTTCGCGCC GACCGGGCTC
ACGTCGATTA CCGGTTCGCC GGGCGGTGCG CTTACGTACA ACACGAACGT GCCCATTTCG
TCAGCGTTCA ATCTGGTCGT CAATTCGGTG TCGCACGGGT TGTTCGCCGA TCTGTCGATC
CTCGAAGCCA ACAATCTCGC CCGCGTGCTG GCACAGCCGA CGCTCGTCGC GTTGTCCGGC
CAGAGCGCGA ATTTCCTGGC GGGTGGTGAA ATTCCGGTGC CGGTGCCGCA ATCGCTGGGC
ACGATCTCGA TCGAATGGAA GCCTTACGGC GTGGGGCTGA CCGTCACGCC GACCGTGTTG
AACCCGCGCC GGATCGCGCT GAAGGTCGCG CCCGAGTCGA GCCAGCTCGA CTTCGTGCAC
TCGATCACGA TCAACGGTGT TCAGGTGCCC GCGCTGACGA CCCGGCGCGC GGATACGACG
GTCGAGCTCG GCGATGGCGA AAGCTTCGTG ATCGGCGGTT TGATCGATCG CGAGACGACC
TCGAACGTCA ACAAGGTGCC GTTCCTGGGC GATCTGCCCA TCATCGGCGC GTTTTTCAAG
AATTTGAGCT ATCAGCAGAG CGACAAGGAA CTCGTGATCA TCGTAACGCC GCATCTCGTT
GCGCCGATCG CGCAGGGCGC GTCGCTGCCG GCGACTCCGG GAGAACTGTC CGAGCAGCGC
GACGGCCCCG TGTGGCGTTC CTATCTGGGC GGCATGGCGT CGCCGGATGC AGGGCCGGGG
TTCTCGAAAT GA
 
Protein sequence
MKKKLIGLAI ALSALAMPPL SGAADTSGTI SLAVGAQRQL ATGRALQRVA VGDPSVADVM 
VVKGGRGGVL LIAKSAGATN VMIWERGHDE PTVYNVSVTS GAARALLDGG TSSVNTYGGT
TVIGGASGSL DGHQRAVHAA KTVSGKDGTT IDASTISGKN VVQVDVRVVE FSRSVLKQAG
LNFFKQSNGF SFGAFAPTGL TSITGSPGGA LTYNTNVPIS SAFNLVVNSV SHGLFADLSI
LEANNLARVL AQPTLVALSG QSANFLAGGE IPVPVPQSLG TISIEWKPYG VGLTVTPTVL
NPRRIALKVA PESSQLDFVH SITINGVQVP ALTTRRADTT VELGDGESFV IGGLIDRETT
SNVNKVPFLG DLPIIGAFFK NLSYQQSDKE LVIIVTPHLV APIAQGASLP ATPGELSEQR
DGPVWRSYLG GMASPDAGPG FSK