Gene BURPS1106A_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1080 
Symbol 
ID4903062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1061631 
End bp1062821 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content71% 
IMG OID640134309 
Productputative hemY protein 
Protein accessionYP_001065359 
Protein GI126451474 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTGC GTGGAATCAT TTGGCTCGCC GTGCTGTTCG CGATCGCCGC GGCGCTCGCG 
ACGGTCGGAC GCTTCGATAC CGGCCAGGTG CTGATCGTCT ATCCGCCGTA TCGCATCGAC
GTGTCGCTGA ACTTCTTCGT GCTCGCGATC ATCGTCGCGT TCATCGTGCT GTACGCACTG
ATGCGGATCG TGCGCAACGT CTGGCGGATG CCGCAGCGCG TGGCCGCGTA TCGCGCGCGG
ATGCGCAACG AGCGTGCGCA GGCCGCGTTG CGCGACGCGC TCGCGAACCT GTACGCGGGC
CGCTTCTCGC GCGCGGAGAA AGCCGCGCGC GACGCGCTCG CGGTCGACGC GAACCAGTCG
GCCGCGAGCC TCGTCGCCGC GGCCGCGACG CACCGGATGC ACGAGTATGC GCGGCGCGAC
GAGTGGCTCG CGAAGGTGAG CGGGCAGGAA TGGCAGGACG CGCGCCTGCT CGCGACGGCC
GACATGCGCG CGGACGGCCG CGACGCGGAG GGCGCGCTCG CCGCGCTCGC CGAGATGCAG
GCGTCGGGCG GCAAGCGGAT TCACGCGCAG CAGATCGCGC TGCGCGCGCA GCAGCAGAAC
AAGAACTGGG CCGAGGTGCT GAAGATCGCG AAGGCGCTCG AAAAGCGCGA GGCGCTGCAT
CCCGCGGCGG CCGTGCGCCT GCGCCAGCAG GCCGCCGAGC ATTTGCTGCG CGATCGCCGG
CACGACGCCG ATGCGCTGCT CGAGGTGTGG CAGTCGCTGT CGGCCGCCGA GCGGCAGTCG
CCGCGCCTCG CGGATCTCGC CGCCGAGCTG CTGATCGCGC TCGAGCGCCG GCAGGAAGCG
CGGCGCATCG TCGAGGACGC GCTCGCGCAC AACTGGAACG CGCGTCTGCT GCGCCGCTAT
CCGGATACGG CGGGTGCCGA CGCGCTGCCG CTGATCCAGA AGGCCGAGGG CTGGCGTCGC
GAGCGGCCGG ACGACGCGGA CCTGCTGTTC GCGCTCGGCC GCCTGTGCCA GCAGCAGCAA
CTGTGGGGCA AGGCGCAGTC GTTCCTCGAA TCGGCGCTGA AGCTGGCCGA CGACGAGCCG
CTCAGGATTC GCGCGCATCG TGCGCTCGCG CGCCTGTTCG AGCATCTGGG CGAGACCGAC
AAGGCCGCGC AGCACTATCG CGAAAGCGCG TTGGCGATCA CGGTCGTGTG A
 
Protein sequence
MTLRGIIWLA VLFAIAAALA TVGRFDTGQV LIVYPPYRID VSLNFFVLAI IVAFIVLYAL 
MRIVRNVWRM PQRVAAYRAR MRNERAQAAL RDALANLYAG RFSRAEKAAR DALAVDANQS
AASLVAAAAT HRMHEYARRD EWLAKVSGQE WQDARLLATA DMRADGRDAE GALAALAEMQ
ASGGKRIHAQ QIALRAQQQN KNWAEVLKIA KALEKREALH PAAAVRLRQQ AAEHLLRDRR
HDADALLEVW QSLSAAERQS PRLADLAAEL LIALERRQEA RRIVEDALAH NWNARLLRRY
PDTAGADALP LIQKAEGWRR ERPDDADLLF ALGRLCQQQQ LWGKAQSFLE SALKLADDEP
LRIRAHRALA RLFEHLGETD KAAQHYRESA LAITVV