Gene BURPS1106A_A2395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2395 
Symbol 
ID4904699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2372063 
End bp2373175 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content63% 
IMG OID640145500 
ProductRieske family iron-sulfur cluster-binding protein 
Protein accessionYP_001076427 
Protein GI126457482 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGAATGT CCAATCTGAG CGACGCACTG CAGCTGAAGT CGGCACATAG CCAGCTTCCC 
GTCACCGCTT ATTTCGATGA GGCGCTCCTC GCGCGCGAAA TCGAAACACT TTTCAAGAAA
GGACCTCGCT ATGTCGGGCA CGAATTGATG GTGCCCGAAG CAGGAGATTA TTTTGCGCTG
CCTTCCGAAG ACGAAGGCCG CGTGCTGGTG CGCAACCAGG CTTCGCAGAT CGAGCTGCTG
TCGAACGTGT GCCGCCACCG CCAGGCGATC ATGCTGAACG GCCGCGGGCG TACGCAGAAC
ATCGTCTGCC CGCTGCATCG CTGGACCTAC GATCTCGAAG GCCAGTTGCT CGGCGCGCCG
CACTTTCCGG ACAAGCCCTG CCTGAACCTG CACGCGACGC CGCTGCAGCA CTGGCAAGGG
CTGCTGTTCG AGGCCGAGGG CCGCGATGTC GCGCACGATC TCGCGCAACT CGGCACGAAG
CACCATTTCG ACTTTTCGGA CTACCTGTTC GATCACGTCG AGATCCACGA GTGCAATTAC
AACTGGAAGA CCTTCATCGA GGTCTACCTC GAGGACTACC ACGTCGTGCC GTTCCATCCG
GGCCTCGGCA GCTTCGTGTC GTGCGACGAC CTGAAGTGGG AATTCGGCGA CTGGTACAGC
GTGCAGACGG TGGGCGTGCA CAACGCGCTC GCGAAGCCGG GCAGCCCGAC GTACCAGAAG
TGGCACGATC AGGTGCTCCG TTATCGCAAC GGCGTGCCGC CGGAGTTCGG CGCGATCTGG
ATGGTCTATT ACCCGGGCCT CATGATCGAG TGGTATCCGC ACGTGCTCGT GGTGTCGTGG
CTGATTCCGC GCGGCCCGCA GAAGACGACG AACATCGTCG AGTTCTACTA CCCCGAGGAA
ATCGCGCTGT TCGAGCGCGA GTTCGTCGAG GCGGAGCGCG CCGCCTATAT GGAGACCGCG
ATCGAGGACG ACGAGATCGC ATGGCGCATG GACGCCGGCC GCCGCGCGCT AATGGAGCGC
GGCGAATCGC AGGTCGGCCC GTATCAGAGC CCGATGGAAG ACGGCATGCA GCACTTCCAC
GAGTTCCTGC GCCGGCAACT CGGCGCGATC TGA
 
Protein sequence
MGMSNLSDAL QLKSAHSQLP VTAYFDEALL AREIETLFKK GPRYVGHELM VPEAGDYFAL 
PSEDEGRVLV RNQASQIELL SNVCRHRQAI MLNGRGRTQN IVCPLHRWTY DLEGQLLGAP
HFPDKPCLNL HATPLQHWQG LLFEAEGRDV AHDLAQLGTK HHFDFSDYLF DHVEIHECNY
NWKTFIEVYL EDYHVVPFHP GLGSFVSCDD LKWEFGDWYS VQTVGVHNAL AKPGSPTYQK
WHDQVLRYRN GVPPEFGAIW MVYYPGLMIE WYPHVLVVSW LIPRGPQKTT NIVEFYYPEE
IALFEREFVE AERAAYMETA IEDDEIAWRM DAGRRALMER GESQVGPYQS PMEDGMQHFH
EFLRRQLGAI