Gene BURPS1106A_0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0749 
Symbol 
ID4900877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp728472 
End bp729770 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content66% 
IMG OID640133979 
Productcytochrome c family protein 
Protein accessionYP_001065031 
Protein GI126454256 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCA AGTCCCTGTT TGCACTCTCG GCTGTCGCGA TCGTCGCGGC AGCGGCTCTC 
GTGCCCGTCC TGTGGCCGGG CAACGACACG CTGCACGGCA ACGCCGCCGT CGCCGCGACG
CCCGCCGATC AGGCCGCGCT CATCAAGAAG GGCGAATACC TCGCGCGCGT CGGCGACTGT
ATCGCGTGCC ACACCGTGCG CGGCGGCAAG CCGTTCGCGG GCGGCCTGCC GATGGCCACG
CCGTTCGGCA CGATGTACAC GCCGAACATC ACGCCGGACG ACCAGGCCGG CATCGGCAAG
TGGACGTCGG ACGATTTCTA CCGCGCGATG CACACGGGCC GCTCGAAGGA CGGCAGCCTG
CTCTATCCGG GCTTCCCGTT CGCGAGCTAC ACGAAGGTCA CGCGCGCGGA TTCGGACGCG
ATCTACGCGT ACCTGCGCTC GGTCGCGCCC GTGAGCACGC CGAGCCGTCC GCACGAGCTG
CGCTTCCCGT TCAACAACCG CAACCTGCTG ATCGGCTGGC GCACGCTGTT CTTCAAGGAA
GGCGAGTACA AGCCGGACCC GACGAAGTCG GTCGAATGGA ACCGCGGCGC GTATCTCGTC
GAAGGCCTCG GCCATTGCTC GATGTGCCAC ACGTCGATCA ACATGATGGG CGGCCCGGTG
AGCTCGGCGG CCTTCGCGGG CGGCCTGATT CCGCTGCAGA ACTGGTACGC GCCGTCGCTC
ACGAACGACA AGGAGCTCGG CCTCGGCGAC TGGCATGTGC AGGAGCTGTC CGATCTGCTG
CAGGCGGGCG TGTCGCACAA GGGCGCGGTG TTCGGCCCGA TGGCGGACGT CGTCCACAAC
AGCCTGCAAT ACATGACGGA CGAGGACACG CGTGCGATGT CGACTTACCT GAAGTCGATC
CCGCAGAAGG CCGAAGCGCC GAAGAACATG CAGTACGAGC CGTCCAAGCA GTTCGGCACG
GCGCTGCTCG AGCAAGGCAA GAAGATCTAT GCCGACAACT GCGCGACCTG CCACGGCCCG
CAGGGCGAAG GCAAGCCGAC CGCTTACCCG CCGCTCGCGC AGAACCGTTC GATCATGATG
GAATCGGCCG TCAATCCGAT CCGCATGGTG CTGAACGGCG GCTATCCGCC CAGCACGTTC
AAGAATCCGC GTCCGTACGG GATGCCCCCG TTCGCGCAGT CGCTGTCGAA TCAGGAAGTC
GCGGCGGTCG TCACGTACAT CCGGATGTCG TGGGGCAACA ACGGTTCGCC GGTCTCGCCG
CAACAGGTGA GCGACCTGCG TTCCGCACCG CTCGACTAA
 
Protein sequence
MKRKSLFALS AVAIVAAAAL VPVLWPGNDT LHGNAAVAAT PADQAALIKK GEYLARVGDC 
IACHTVRGGK PFAGGLPMAT PFGTMYTPNI TPDDQAGIGK WTSDDFYRAM HTGRSKDGSL
LYPGFPFASY TKVTRADSDA IYAYLRSVAP VSTPSRPHEL RFPFNNRNLL IGWRTLFFKE
GEYKPDPTKS VEWNRGAYLV EGLGHCSMCH TSINMMGGPV SSAAFAGGLI PLQNWYAPSL
TNDKELGLGD WHVQELSDLL QAGVSHKGAV FGPMADVVHN SLQYMTDEDT RAMSTYLKSI
PQKAEAPKNM QYEPSKQFGT ALLEQGKKIY ADNCATCHGP QGEGKPTAYP PLAQNRSIMM
ESAVNPIRMV LNGGYPPSTF KNPRPYGMPP FAQSLSNQEV AAVVTYIRMS WGNNGSPVSP
QQVSDLRSAP LD