Gene Bcep18194_A3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3841 
Symbol 
ID3749025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp748041 
End bp749975 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content68% 
IMG OID637762119 
Productanthranilate synthase component I/chorismate-binding protein 
Protein accessionYP_368084 
Protein GI78065315 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAG GTAACGAGAG CGCGTCGTTC GCGCTCTTGG ACGATTGCGA CTCGACCGCG 
CTCGCGCGGT CGAGTCGTTT GTATTCGGGG TTCGTGCGCG AACGTGTGTG CACGGATCCG
GCTCGACTCG ACGAGGTCGA CGCAGCCGTG GCGCAGGATC TGCGCGACGG GCTGCATGCG
GTCGTCGTCG GCGATTACGA ATTCGGACGC AATCTGCAAC GAGCGCAGCC GGGCCATGCC
CCGCTGCGCT TTTTGCTGTT TGCGCGCTGC GAGCGCCTGT CGCGGGACGA AGTCGACGCG
TGGCTCGCGC AGCGGGACGG CGGCGGCACG CCGTCGATCG CGGGCGTCGC GCATGTCGCG
AAGAGCGTGT CGCGCGATGC GTTCGACGTG GCGATCGCCG CGGTGCACGA CGCGCTGCGC
GCAGGCGATT CGTATCAGGT CAACTACACG TACCGGCTGA ACTTCGACGT GTTCGGCACG
CCGCTCGCGC TGTACCGGCG GCTGCGTGCG CGTCAGCCCG TGCGCTACGG TGCGCTGATC
GCGTTGCCCG ACGGCACGTG GGTCGTGTCG TGCTCGCCCG AGCTGTTCGT CGAGAAGTAC
GGCGACGTGC TGCGCGCGCG GCCGATGAAG GGCACCGCGC CACGTTCGGC CGACCCGCGC
GACGATGCGG CCGCGGCCAC GTTCCTTGCG AACGATCCGA AGAACCGCGC GGAAAACGTG
ATGATCGTCG ACTTGCTGCG CAACGACGTG TCGCGGATCG CGCGCACCGG GACGGTCCGC
GTGCCGGCGC TGTTCTCCGT CGAGCCGTAT GCGTCGGTGT GGCAGATGAC GTCGACGGTC
GAGGCCGGCT GGCGCGACGG AACGACGTTC GCGCAGATGC TGCGCGCGCT GTTTCCGTGC
GGATCGATCA CGGGCGCGCC GAAGCACAAG ACGATGCAGC TGATCGATGC GATCGAGTCG
ACGCCGCGCG GGCTCTATAC GGGCGCGATC GGCTGGCTTG ACGCTGCGAA ACAAGGCGCG
GATTCCGACG CGCCAGGTGA TCGCCTGGCA GGTTGCGGCG ATTTTTGCCT GTCGGTCGCG
ATCCGTACGT TGACGCTCGA TGCGGCCGGC GAAGGCGATG ATCGTGGAGG TGCAACGCGA
GCCGACGTCG AAGCACGCCA ACCGGCAACG GCAATCGCCG GCCGGCGCCG CGGCACGATG
GGTGTCGGCG CGGGCATCGT GCTCGACAGT GTCGCGGCCG ACGAATATGC GGAGTGCGAA
TTGAAAGCGC GATTCCTGAC GGATGCCGAT CCCGGCTTCC AGCTGTTCGA AACGACTGCC
GCCACGCGTG CGGACGGCAT ACGGCATCTC GATCGCCATC TCGCGCGGCT GCAGCGTAGC
GCGGATGCGT TCGGCTTCCG TTTCGACACC GATGCATTGC GTCGCGAGAT CGACGCGCGT
TGTGCGGCGC TCGACGGCGA CGGCGCATAC CGGATGAAGC TCTCGCTCGC GAAGGACGGC
ACGATCGAGA TCGTCGCGGC ACCGCTCAAG CCGCTGCCGG CGGGGCCGGT CGGCGTGCTG
CTGGCGTCCG CGCACGGCTT CGCACCGACC CGTACGAGCG ATGCGCTGCT GCTGCACAAG
ACCACACGCC GCGCCGAATA CGATCGCGCG TGGCAGGCGG CGGAGGCGCT TGGCGGCTTC
GACATGCTGT TCGTCAACGA GCGCGGCGAG GTGACGGAAG GCGGGCGCTC GAACCTGTTC
GTGAAGCTCG ACGGCCAGTG GGTGACGCCG CCGCTCGAGT CGGGCGTGCT GCCGGGCGTG
ATGCGCGGCG TGCTGCTCGA CGATCGTGCG TTCAGCGCGA CGGAGCGGGT CGTGACCCGC
GACGATCTCG CGCGTGCGGA GGCGCTGCTG CTGACCAACG CGCTACGCGG CGCGCTCGAC
GCGGTACTGA AGTGA
 
Protein sequence
MTEGNESASF ALLDDCDSTA LARSSRLYSG FVRERVCTDP ARLDEVDAAV AQDLRDGLHA 
VVVGDYEFGR NLQRAQPGHA PLRFLLFARC ERLSRDEVDA WLAQRDGGGT PSIAGVAHVA
KSVSRDAFDV AIAAVHDALR AGDSYQVNYT YRLNFDVFGT PLALYRRLRA RQPVRYGALI
ALPDGTWVVS CSPELFVEKY GDVLRARPMK GTAPRSADPR DDAAAATFLA NDPKNRAENV
MIVDLLRNDV SRIARTGTVR VPALFSVEPY ASVWQMTSTV EAGWRDGTTF AQMLRALFPC
GSITGAPKHK TMQLIDAIES TPRGLYTGAI GWLDAAKQGA DSDAPGDRLA GCGDFCLSVA
IRTLTLDAAG EGDDRGGATR ADVEARQPAT AIAGRRRGTM GVGAGIVLDS VAADEYAECE
LKARFLTDAD PGFQLFETTA ATRADGIRHL DRHLARLQRS ADAFGFRFDT DALRREIDAR
CAALDGDGAY RMKLSLAKDG TIEIVAAPLK PLPAGPVGVL LASAHGFAPT RTSDALLLHK
TTRRAEYDRA WQAAEALGGF DMLFVNERGE VTEGGRSNLF VKLDGQWVTP PLESGVLPGV
MRGVLLDDRA FSATERVVTR DDLARAEALL LTNALRGALD AVLK