Gene Bphyt_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_1472 
Symbol 
ID6282503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010681 
Strand
Start bp1649259 
End bp1650263 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content64% 
IMG OID642621038 
ProductAminocarboxymuconate-semialdehyde decarboxylase 
Protein accessionYP_001895111 
Protein GI187923469 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCCC ACTTCTTTCC GCCGATCGCG CGGGAAGAGG CCGCACGTCT CGACGCGCAA 
CACGCGCCGT GGCTGCAGAT CGACGCGGAC GGCGAACGCG GCATGATCAT GACTGGCGAG
AAGCGTTTTC GCCCGGTGTA TCGCGCGCTG TGGGACCCGG CGGCGCGGAT TGCCGAAATG
GACGCGCTCG GCGTCGACAT TCAATTGATG TGCGCGACAC CCGTGATGTT CGGCTATGGC
TATGGAGCGG CCGCCGCGCA CGATTGGGCG GCGCGCATGA ACGACCACGC GCTAGAACTC
TGCGCACACG CACCGCAGCG CTTGATGGCG CTGGCCCAGG TGCCGCTGCA GGATGTCGAG
CTGGCTTGCC GCGAAGCGAC GCGCGCGCAT CGCGCCGGCC ATCGCGGCGT GCAGATCGGC
AATCACCTCG GCCCGCGCGA TCTCGACGAC GAACATCTCG TGACCTTCCT CACGCACTGC
GCGAACGAAG GCATTCCCGT ACTCGTGCAT CCGTGGGACA TGATGACCGA CGGCCGCATG
AAGAAATGGA TGCTGCCGTG GCTGGTCGCC ATGCCGGCGG AAACGCAATT GAGCATGGTG
TCGCTGATTC TGTCGGGCGC GTTCGAGCGG ATTCCGAAGT CGCTCAAGTT GTGCTTCGCG
CATGGCGGCG GCAGCTTTGC GTTTCTGCTC GGCCGTGTGC AGAACGCGTG GGAGCAGCGC
GACATCGTGC GTGAAGATTG CCCCAATCCA CCAGTGTCGT ACCTCGAACG CTTTCACGTG
GACAGCGCCG TGTTCGACGA AGGCGCGTTG CGTCTGCTCG TCGAAACGAT GGGCGAGGAC
CATGTGCTGC TCGGTTCCGA CTATCCGTTC CCGCTCGGCG AACTGAAGAT CGGCGACCTC
GTCGCGCATC ATCCGCAACT GAGCGAGACG GCCAAGGCCA AGATCCTCGG CGCTAACGCA
CAGCGATTCT TCGGCCTGCC GGTGAACATC GCGCACGCTT CTTAA
 
Protein sequence
MHSHFFPPIA REEAARLDAQ HAPWLQIDAD GERGMIMTGE KRFRPVYRAL WDPAARIAEM 
DALGVDIQLM CATPVMFGYG YGAAAAHDWA ARMNDHALEL CAHAPQRLMA LAQVPLQDVE
LACREATRAH RAGHRGVQIG NHLGPRDLDD EHLVTFLTHC ANEGIPVLVH PWDMMTDGRM
KKWMLPWLVA MPAETQLSMV SLILSGAFER IPKSLKLCFA HGGGSFAFLL GRVQNAWEQR
DIVREDCPNP PVSYLERFHV DSAVFDEGAL RLLVETMGED HVLLGSDYPF PLGELKIGDL
VAHHPQLSET AKAKILGANA QRFFGLPVNI AHAS