Gene BURPS668_A3055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3055 
Symbol 
ID4886239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2901425 
End bp2902585 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID640132991 
Productradical SAM domain-containing protein 
Protein accessionYP_001064046 
Protein GI126442706 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR03470] hopanoid biosynthesis associated radical SAM protein HpnH 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.615551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTATTC CGCTGCTCCA GCAAGTCCGC GTTGGCGCAT ACATCATGCG CCAGCACCTG 
TCCGGCAACA AACGCTATCC GCTCGCGCTG ATGCTCGAGC CCCTCTTCCG CTGCAACCTC
GCGTGCAACG GCTGCGGCAA GATCGACTAT CCGGATCCGA TCCTGAACCA GCGCCTGTCC
GTCGAGGAAT GCCTGCAGGC CGTCGACGAG TGCGGCGCGC CCGTGGTGTC GATCGCGGGC
GGCGAGCCGC TGCTGCACAA GGAAATGCCG GAAATCGTCA AGGGCATCAT GAAGCGCAAG
AAGTTCGTCT ACCTGTGCAC GAACGCGCTG CTGATGGAAA AGAAGATGGA CGATTACGCG
CCGAGCCCGT ACTTCGTCTG GTCGGTCCAT CTCGACGGCG ACCGGGAGAT GCACGATCAC
TCGGTGTCGC AGGAAGGCGT GTACGACAAG GCCGTCGCGG CGATCCGCGA AGCGAAGCGC
CGCGGCTTCC GCGTGAACAT CAACTGCACG CTGTTCAACG ATGCGCTCCC CGAACGCGTC
GCGAAGTTCT TCGATACGCT GGGGCCGATC GGCGTCGACG GCATCACCGT GTCGCCGGGC
TACGCGTACG AGCGCGCGCC GGATCAGCAG CACTTCCTGA ACCGCGACAA GACGAAGAAC
CTGTTCCGCG AAGTCTTCAA GCGCGGCGAA GGCGGCAAGC GCTGGTCGTT CAGCCAGTCG
TCGCTGTTCC TCGATTTCCT CGCCGGCAAC CAGACGTACA AGTGCACGCC GTGGGGCAAC
CCGGCGCGCA CGGTGTTCGG CTGGCAGAAG CCGTGCTACC TGGTCGGCGA AGGCTACGTG
AAGACCTTCA AGGAGCTGAT GGAATCGACC GACTGGGACA ACTACGGCGT CGGCAACTAC
GAAAAGTGTG CGGACTGCAT GGTCCACTGC GGCTTCGAGG CCACCGCCGT GATGGATACG
ATCGCGCATC CGCTGAAGGC GCTGAAGGTG TCGATGAGCG GCATCCGGAC CGAAGGCGCG
TTCGCGCCGG ATATTCCGAT CGACAACCAG CGTCCGGCCG AGTATGTGTT CTCGCGCCAC
GTGGAAATCA AGCTCGAGGA GATCCAGCGC GCGGGCAAGG GCAAGCTGCA GAAGGCGCCG
AAGCCCGCCG CGACGGCCTG A
 
Protein sequence
MSIPLLQQVR VGAYIMRQHL SGNKRYPLAL MLEPLFRCNL ACNGCGKIDY PDPILNQRLS 
VEECLQAVDE CGAPVVSIAG GEPLLHKEMP EIVKGIMKRK KFVYLCTNAL LMEKKMDDYA
PSPYFVWSVH LDGDREMHDH SVSQEGVYDK AVAAIREAKR RGFRVNINCT LFNDALPERV
AKFFDTLGPI GVDGITVSPG YAYERAPDQQ HFLNRDKTKN LFREVFKRGE GGKRWSFSQS
SLFLDFLAGN QTYKCTPWGN PARTVFGWQK PCYLVGEGYV KTFKELMEST DWDNYGVGNY
EKCADCMVHC GFEATAVMDT IAHPLKALKV SMSGIRTEGA FAPDIPIDNQ RPAEYVFSRH
VEIKLEEIQR AGKGKLQKAP KPAATA