Gene Bcep18194_A5652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A5652 
Symbol 
ID3750880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp2750131 
End bp2751411 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID637763968 
ProductO-antigen polymerase 
Protein accessionYP_369890 
Protein GI78067121 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCGT TTTCCGCTCC CGCCTCGCGG CGCCTGACCG CCGCCCGTGC ATTCGCCGTT 
GCCGCGCTCT GCATGGTGCC GGTCTCGACC GCGCTGACCA ACGTGTTCTG CGGGCTGTTC
GCCGCCGCGC TCGTGATTTC CCCCGAGTTC TGGCGCGACC TGCGCTCGTT CGTCACCGAC
CCGGCCTCGC TCGCGGCGCT GCTGATCCTG GCCGCGCTCG CCGCCAGCGT CACTTATACG
GTTGCACCGC ACAACAAGGC GTGGAACTGG GTCGCCAAGT ACGACAAGCT GCTGCTGCTG
CCGTTCGCCG TGCTCGCCTT CCGCCATTCG AACTGGGCAC CGATCGTCCG CCGTTGCTGG
TTCGGCACGC TGTGCGTGAT CCTGCTGTTG TCGACCACGA ACTATCTCGG CCTGACCGCG
ATCGGGCCCG CGCACGCGAC CGAACTGCCG CTGTCGCGCG CGTGGGTGTT CAAGAACCAC
ATCGCCGCCG GCATGTTCGG CGCGCTGCTG TTCTACCAGG CGGCCGATCT CGCGCTGGCG
GCCCGCACGG CGCTGTCGCG CGCCGCGTAT GCAGGCGTCG CCGCGTGGTC GCTCGTCAAC
GTGTTCGTGA TGCTGCAGGG ACGCACCGGG CAGGTCATCG CGCTGCTGCT GATCCTCGTC
GTCGCCGTAC GTTTCGTGCT GTTGCTGCGC CGGCAATCGG CGTTGCGCGC GGCGCTCGCC
GCCGGCGTGT TCGTGCTGGC CGGCATCGCG CTCGTGATCG CCGCATGCAC GGTTCACAAC
GGCCGGCTGA CGAAGGTCGT GACGGAAGTG CAGCAATACC GGCAGAGCGA TGCGGCCACG
TCCACCGGGC TGCGCCTCGA GTGGTACAAG AAGGGGCTCG AGCTGTTTCG CCAGCGCCCG
GTGATCGGCT ACGGCGCAGG CGGCCTCGAA TCCGAATTCG AGAAGCTCAC GGCCGGCAAG
ACGGCGGCCG AAGGCCAGCT CACGTCGAAC CCGCACAATG AATACCTGCT GATGGCCGTA
CAGCTCGGCG CGGTCGGCCT GCTGCTGTTC ATCAACCTGA TCGTGCAGAT TGCACGCGGC
AGCCGCACGC TCGATCCGCG CTCGCGGCAT CTGCTGCTCG CCTGGCTCGC GATCTTCGCA
ATCGGCAGTC TCGCGAATTC GCTGCTGCTC GATTTCGCCG AAGGGCACCT GATCGTGCTG
CTGGCCGGCA TCCTGCTCGG CTGCGGCGAA CGCGCCGAGG CGCTGCCGCG CGAAACGTCG
GCGATCCGGC GCAGCGCGTA A
 
Protein sequence
MLSFSAPASR RLTAARAFAV AALCMVPVST ALTNVFCGLF AAALVISPEF WRDLRSFVTD 
PASLAALLIL AALAASVTYT VAPHNKAWNW VAKYDKLLLL PFAVLAFRHS NWAPIVRRCW
FGTLCVILLL STTNYLGLTA IGPAHATELP LSRAWVFKNH IAAGMFGALL FYQAADLALA
ARTALSRAAY AGVAAWSLVN VFVMLQGRTG QVIALLLILV VAVRFVLLLR RQSALRAALA
AGVFVLAGIA LVIAACTVHN GRLTKVVTEV QQYRQSDAAT STGLRLEWYK KGLELFRQRP
VIGYGAGGLE SEFEKLTAGK TAAEGQLTSN PHNEYLLMAV QLGAVGLLLF INLIVQIARG
SRTLDPRSRH LLLAWLAIFA IGSLANSLLL DFAEGHLIVL LAGILLGCGE RAEALPRETS
AIRRSA