Gene Bcep18194_A3857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3857 
Symbol 
ID3749041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp765430 
End bp767280 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content58% 
IMG OID637762135 
Productcapsule polysaccharide biosynthesis 
Protein accessionYP_368100 
Protein GI78065331 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.211188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGTT CACCGACGAT CCATACGCAC TCTTGGCCGA ATGGCATCAT GCCACGCAAA 
AAAGGGCCGG CGCTCTCGTG GTTCACCACT TCCCTCGAAT CTGAAACGGC CGATGGATGG
ATCGGCCGCA TCGATGCCGA ACTCGCGACC TTATCCGCAA TCGGCTCGGC ATTGAATATC
ACGCGGCTCG TGGAGCGATT CCGTGCCGCC AACGTATTCG ATCCATCGAG CTGCGCCACG
CGTATTCCGC TCGGCTTGCT CGAGCGCGGC ACCGCACAGC GAGTGCTGGT GCTCGACGAG
CCCGCATCCG TCTATTTGCA GGCACCGCGC AGAGAAAGGG AAAGGCAGTT TTCCCGAATG
CTGGCCGCAG TGCACCGCGA GCAACCAGAG GCAGAAATTT GGTTCGCCCG TAGTGGCATA
TCGAGTTCAG GAGAATGGCT TTCGTCGTCC CATCCAGGAA TCCGCGGTTC GAATCGATAC
ATCGACACAA GCGAGTCACT CTGCGCTTCC CTCCCGTATT TCGACCACGT CTATACGCTC
TCCGCAGTTG AAGGGATGCA AGCGTTGCTT TGTGGCGTCC CGGTACATGT ATTCGGTATG
CCGTACTACG CTGGCTGGGG ATTGACGCAC GACGATGCGC CCCAGCCGGC AAGGCAATCG
CACGCGACTC TGGAATCATT GTTCGAAGTT GTGTTCATAC GCCTTTCCCG TCATCTCGCC
CCTGCAGGAA AGACATCCGA TTCGCTCGAA GCACTCCTTG ACGCAATCGA AGCTCATCGC
GCAACCGTAT TACGCTTCGC CGATATACGC CACGTAGCAG GCATCCGTTT CCAGTGGTGG
AAACGACCCT TCGCTACTCC TTTCCTGACG GCCGGTGGCG GAACACTGCG CTGGACGGAT
GACGCAAGCA AGCTTGCGGA AGGAGAGCAT GCCGCATTTT GGGGTGCGCG TAGTACAGAG
GGCTTGCCAC CCGACACACC GGCCGTCCGT ATCGAGGACG GCTTCCTGCA CTCGATCGGC
CTCGGCTCAG ACCACGTCGC TCCATGCAGC CAGATAATCG ACCGGCGCGG CCTCTATTTC
GACCCGAGCC GCCCGAGCGA CCTGACGGTG ATCCTGAACG AAACGGATTT CAATGAAACT
GAACTCGCGC GGGCTGACGC ATTGCGCAAC GAGATCACCC GCCTAGGGTT GACAAAGTAC
AATCTCGGTC GCCGCAAGCC GGCTTGGCAC GCGCCGCCGG GCAAGCGCGT TGTGCTCGTG
CCGGGACAGG TCGCAGACGA CGCGTCGATT CGACTCGGTA CGCGCGGCAT CACGACAACC
GAAGAATTGC TGCGAACGGT ACGCGCCCAC AACCCGGACG CATTTATTGT CTATAAACCC
CATCCTGACG TCCTGTCGGG CAATCGCCGC GGCTTAATAG AAGCGGCAGC TCTCGCCGAC
GTGATGGAGC AGGACTCTGA TCTGATTTCG CTGATTGAGA TAGCGGATGA AGTGCACACG
CTTTCTTCGC TGTCTGGTTT TGAAGCGTTG ATTCGCCGGA AAGCTGTCTT CACATATGGA
CTGCCCTTTT ATGCAGGCTG GGGGTTGACA CACGATGCGC TCGCACCACC TTGGCGCGAT
CGCAAGCTCT CGCTCGATAT GCTGACGGCA GGTGTATTGC TGCGCTACCC GATCTATTGG
GATTGGACTC TTCATCTGTT TACATCGCCC GAAGCCATCG TTCGAAAATT GGCGATACCT
GCGAAACGCC CACTCGTGAA AATTCGAGGT AATCGTTTGC GCCCACTTCT AAAAGCAATT
CGTTGGAGCA GGAATGGACT CCAGCACCTC GCATGGCGTT GCAGTCAATG A
 
Protein sequence
MSRSPTIHTH SWPNGIMPRK KGPALSWFTT SLESETADGW IGRIDAELAT LSAIGSALNI 
TRLVERFRAA NVFDPSSCAT RIPLGLLERG TAQRVLVLDE PASVYLQAPR RERERQFSRM
LAAVHREQPE AEIWFARSGI SSSGEWLSSS HPGIRGSNRY IDTSESLCAS LPYFDHVYTL
SAVEGMQALL CGVPVHVFGM PYYAGWGLTH DDAPQPARQS HATLESLFEV VFIRLSRHLA
PAGKTSDSLE ALLDAIEAHR ATVLRFADIR HVAGIRFQWW KRPFATPFLT AGGGTLRWTD
DASKLAEGEH AAFWGARSTE GLPPDTPAVR IEDGFLHSIG LGSDHVAPCS QIIDRRGLYF
DPSRPSDLTV ILNETDFNET ELARADALRN EITRLGLTKY NLGRRKPAWH APPGKRVVLV
PGQVADDASI RLGTRGITTT EELLRTVRAH NPDAFIVYKP HPDVLSGNRR GLIEAAALAD
VMEQDSDLIS LIEIADEVHT LSSLSGFEAL IRRKAVFTYG LPFYAGWGLT HDALAPPWRD
RKLSLDMLTA GVLLRYPIYW DWTLHLFTSP EAIVRKLAIP AKRPLVKIRG NRLRPLLKAI
RWSRNGLQHL AWRCSQ