Gene Bcep18194_A3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3621 
Symbol 
ID3748799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp494821 
End bp496314 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content65% 
IMG OID637761895 
Productanthranilate synthase component I 
Protein accessionYP_367866 
Protein GI78065097 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC TCGAATTTCA ATCGCTCGCG AACGAAGGCT ACAACCGCAT CCCGCTGATC 
GCGGAAGCCC TCGCCGATCT CGAAACGCCG CTGTCCCTCT ACCTGAAGCT GGCCCAGCCC
GAACGCTCGG GCGCCAACTC GTTCCTGCTC GAATCGGTGG TCGGCGGCGA ACGCTTCGGC
CGCTATTCGT TCATCGGCCT GCCCGCCCGT ACGCTCGTGC GTACCCGCAA CGGCGTGTCG
GAAGTCGTGC GCGACGGCCA GGTCGTCGAG ACGCATGACG GTGACCCGTT CCAGTTCATC
GAATCGTTCC AGGCGCGCTT CAAGGTGGCG CAGCGCCCGG GCCTGCCGCG CTTCTGCGGC
GGTCTCGCCG GCTATTTCGG CTACGACGCG GTGCGTTACA TCGAGAAGAA GCTCGCGAAC
ACCACGCCGC GCGACGATCT CGGCCTGCCC GATATCCAGT TGCTGCTGAC CGAGGAAGTG
GCGGTCATCG ACAACCTCGC CGGCAAGCTG TACCTGATCA TCTACGCCGA CCCGAGCCAG
GCCGAAGCCT ATACGAAGGC GAAGCAGCGC CTGCGCGAAC TGAAGCAGCG CCTGCGCACG
ACCGTGCAGC CGCCGGTCAC GTCGGCGAGC GTGCGCACCG AGACGTTCCG CGAGTTCAAG
AAGGACGACT ATCTGGCCGC CGTGCGCCAG GCGAAAGAAT ACATCGCGGC CGGCGAGCTG
ATGCAGATCC AGGTCGGCCA GCGCCTGACG AAGCCGTATC GCGACAATCC GCTGTCGCTG
TATCGCGCAC TGCGTTCGCT GAACCCGTCG CCGTACATGT ACTACTACAA CTTCGGCGAT
TTCCACGTGG TCGGCGCATC GCCGGAAATC CTCGTGCGCC AGGAAAAGCG TGGCGAAGAC
CAGATCGTCA CGATCCGCCC GCTGGCCGGC ACGCGCCCGC GCGGCAACAC GCCCGAGCGC
GACGCGGAAC TCGCGACCGA GCTGCTCAAC GATCCGAAGG AGATCGCCGA GCACGTGATG
CTGATCGACC TCGCGCGCAA CGACGTCGGC CGCATCGCGG AAATCGGCTC GGTGCACGTG
ACCGACCAGA TGGTCATCGA GAAATACTCG CACGTGCAGC ACATCGTCAG CTCGGTCGAA
GGCAAGCTGA AGCCCGGCAT GACGAACTAC GACGTGCTGC GCGCGACGTT CCCGGCCGGC
ACGCTGTCCG GTGCGCCGAA GGTCCGTGCG ATGGAACTGA TCGACGAACT CGAGCCGGTC
AAGCGCGGGT TGTACGGCGG CGCCGTCGGC TACCTGTCGT TCTCGGGCGA GATGGATCTC
GCGATCGCGA TCCGCACGGG CCTGATCCAC AACGGCAACC TGTACGTGCA AGCGGCGGCC
GGCGTCGTCG CGGATTCGGT GCCCGAATCC GAATGGCAAG AGACCGAGAA CAAGGCGCGC
GCAGTGCTGC GTGCGGCCGA ACAGGTCCAG GACGGCCTCG ATAGCGACTT CTGA
 
Protein sequence
MTELEFQSLA NEGYNRIPLI AEALADLETP LSLYLKLAQP ERSGANSFLL ESVVGGERFG 
RYSFIGLPAR TLVRTRNGVS EVVRDGQVVE THDGDPFQFI ESFQARFKVA QRPGLPRFCG
GLAGYFGYDA VRYIEKKLAN TTPRDDLGLP DIQLLLTEEV AVIDNLAGKL YLIIYADPSQ
AEAYTKAKQR LRELKQRLRT TVQPPVTSAS VRTETFREFK KDDYLAAVRQ AKEYIAAGEL
MQIQVGQRLT KPYRDNPLSL YRALRSLNPS PYMYYYNFGD FHVVGASPEI LVRQEKRGED
QIVTIRPLAG TRPRGNTPER DAELATELLN DPKEIAEHVM LIDLARNDVG RIAEIGSVHV
TDQMVIEKYS HVQHIVSSVE GKLKPGMTNY DVLRATFPAG TLSGAPKVRA MELIDELEPV
KRGLYGGAVG YLSFSGEMDL AIAIRTGLIH NGNLYVQAAA GVVADSVPES EWQETENKAR
AVLRAAEQVQ DGLDSDF