Gene Bcep18194_A3630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3630 
Symbol 
ID3748808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp504183 
End bp505889 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content67% 
IMG OID637761904 
Productphenylacetic acid degradation protein paaN2 
Protein accessionYP_367875 
Protein GI78065106 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02288] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000209323 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.623158 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATG CACTGTTCAC GAAGCACGAA GACACGCTGA AACACGCACT CGCCGCCATC 
GAGAGCCGCG GGTACTGGAG CCCGTTCGCC GAAATGCCGA GCCCCAAAGT GTACGGGGAA
AGCGCGAACG CAGATGGCGA GGCCGCGTTC AAGTCGCACC TCGACAAGAC GTTCGCGCTC
GACCAGCCAG CGTCCGGAGA AACGGTCGGC GCAGAGCGGT CGCCGTACGG CGTTGCACTG
GGCATCCGGT ATCCGAAGTC GACGCCCGAC GAACTGATCG CCGCCGCTGC CGCCGCGCAA
CGCTCGTGGC GCGAAGCCGG CCCGAGCGCC TGGATCGGCG TCAGCCTCGA AATCCTCGCG
CGCCTGAATC GGGCCAGCTT CGAAATCGCC TACAGCGTGA TGCACACCAC GGGGCAGGCA
TTCATGATGG CGTTCCAGGC CGGCGGCCCG CACGCGCAAG ATCGCGCGCT CGAGGCAGTC
GCCTATGCAT GGGACGAACT GCGCCGCATC CCCGCCGAAG CACACTGGGA AAAGCCGCAA
GGCAAGAACC CGCCGCTCGC GATGCACAAG CGCTATACGA TCGTCCCGCG CGGCACCGGG
CTCGTGCTCG GCTGCTGCAC GTTCCCGACC TGGAACGGCT ACCCCGGCCT GTTCGCCGAT
CTCGCAACCG GCAACACCGT GATCGTCAAA CCGCATCCGG GCGCGATCCT GCCCCTCGCC
GTCACCGTCC GGATCGCTCG CGACGTGCTG CGCGAAGCCG GTTTCGATCC GAACGTCGTC
ACGCTCCTCG CGACCGAACC CAACGACGGC GCACTCGTTC AGGATCTCGC GCTGCGCCCC
GAGATCAAGT TGATCGACTT CACCGGCAGC ACGCAAAACG GCACGTGGCT CGAACGCCAT
GCTCACCAGG CACAGGTCTA TACGGAGAAG GCTGGCGTCA ACCAGATCGT GATCGACTCG
ACCGACGACC TGAAGGCCGC CGCCAAAAAC ATCGCGTTCT CGCTGGCGCT GTACTCCGGC
CAGATGTGCA CGGCACCGCA GAACATCTAC GTGCCGCGCG ACGGCATCCG GACGGCGGAC
GGCCATGCGA GCTTCGACGA GGTCGCACAG GCAATCGCCC TTTCCGTGCA AAAACTCACC
GGCGACCCGG CACGCTCGGT CGAACTGATC GGCGCAATCC AGAACGAAGG CGTGACCGCA
CGCATCGACG ATGCCCGCCA GCTCGGCCGC GTGCTCGCCG ACAGCCTGAC CCTCCAACAC
CCTGCGTTCC CCGATGCCCG CGTGCGCACG CCGCTCGTGC TGCAACTCGA CGTGGCCGAT
CGCGAGAAAT TCACGCAGGA ATGGTTCGGC CCGATCTCGT TCGTGATCGC GACCGACTCG
ACCGTGCAAT CGCTCGACCT CGCCGGGGAA ATCGCGGCGG AACATGGCGC GCTGACGCTC
TCCGTCTACA GCACCGACGA TGCGATCGTC GACGCCGCCC ACGACGCGGC GGTGCGCGGT
GGCGTCGCGC TGTCGATCAA CCTGACGGGC GGTGTGTTCG TGAACCAGTC GGCAGCGTTC
TCGGATTTCC ACGGCACGGG CGCGAACCCT GCAGCCAACG CGGCGCTGGC CGACCCGGCG
TTCGTCGCCA ACCGGTTCCG CGTGGTGCAA AGCCGCGTCC ATGTTGCCCC GAAGGCTGCA
CCTGTGGAAG CCGGCCAGCC GGCATAA
 
Protein sequence
MTHALFTKHE DTLKHALAAI ESRGYWSPFA EMPSPKVYGE SANADGEAAF KSHLDKTFAL 
DQPASGETVG AERSPYGVAL GIRYPKSTPD ELIAAAAAAQ RSWREAGPSA WIGVSLEILA
RLNRASFEIA YSVMHTTGQA FMMAFQAGGP HAQDRALEAV AYAWDELRRI PAEAHWEKPQ
GKNPPLAMHK RYTIVPRGTG LVLGCCTFPT WNGYPGLFAD LATGNTVIVK PHPGAILPLA
VTVRIARDVL REAGFDPNVV TLLATEPNDG ALVQDLALRP EIKLIDFTGS TQNGTWLERH
AHQAQVYTEK AGVNQIVIDS TDDLKAAAKN IAFSLALYSG QMCTAPQNIY VPRDGIRTAD
GHASFDEVAQ AIALSVQKLT GDPARSVELI GAIQNEGVTA RIDDARQLGR VLADSLTLQH
PAFPDARVRT PLVLQLDVAD REKFTQEWFG PISFVIATDS TVQSLDLAGE IAAEHGALTL
SVYSTDDAIV DAAHDAAVRG GVALSINLTG GVFVNQSAAF SDFHGTGANP AANAALADPA
FVANRFRVVQ SRVHVAPKAA PVEAGQPA