Gene BURPS668_2862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2862 
Symbol 
ID4884976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2818496 
End bp2819647 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content68% 
IMG OID640128790 
Productradical SAM domain-containing protein 
Protein accessionYP_001059881 
Protein GI126439549 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.22952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC GATACGACGT CGAATACCCG GTGATGCCGC CCGCGCCCCG TAAGGGGCGC 
GGCGCGTTGG GCAACCTGCA GGGGCGCTAC GAGAGCGTCG AGCGCGAGGC GGTCAACGAC
GGCTGGACGC GCGACGGCGA GCCGCGCGCG CCGCTGCGCA CGCAGGTGTT CGAGGAGCGC
GCCAGGACGA TCCTCACGCG CAATGCGTCA CCCGACATTC CGTTCAATGT ATCGCTGAAT
CCGTATCGCG GCTGCGAGCA CGGCTGCATC TACTGCTTCG CGCGGCCGAC GCACAGCTAT
CTCGGGCTGT CGCCGGGGCT CGATTTCGAA AGCCGGATCT ACGCGAAGGT GAATGCCGCG
GAGTTGCTCG CGCGCGAACT CGCGAAGCCG CGCTACGTGC CCGAGCCGAT CGCGCTCGGC
GTGAATACGG ACGCGTATCA GCCGGTCGAG CGCGAACGGC GGATCACGCG GCAGGTGATC
CAGGTGATGC ATGACCACGG TCAGCCGTTT GCCGCGATCA CGAAGTCGTC GCTGATCGAG
CGTGATCTCG ATCTGCTCGC GCCGATGGCC GAGCGCCGGC AGGTGATGGC GGCCGTCACG
ATCACGACGC TCGATCCCGA GCTCGCGCGC GCGCTCGAGC CGCGCGCCGC GACGCCCTCG
CGCCGGCTGC GGACGATCCG CGCGCTGCGC GACGCGGGGG TGCCGGTCGG CGTGAGCATC
GCGCCGATGA TCCCGTTCGT CACCGAACCG GATCTCGAGC GCGTGCTCGA GGCGTGCGCG
GACGCGGGGG CGACGCACGC GAGCTATATC GTGTTGCGAT TGCCGTGGGA AGTCGCGCCG
CTTTTCACCG AATGGCTCGC CGCGCATTTT CCGGATCGCG CGGAGCGCGT GATGGCGCGT
GTGAGGGACA TGCGCGGCGG CAAGGATTAC GACGCGGATT TCAGCCGCCG GATGAAAGGC
GAAGGAATGT GGGCCGAGTT GCTCAAGCAG CGCTTCCGGA TGGCGGTCAA GCGCTGCGGG
CTGAACGAAC GCGCGCGGGG AATTCTCGAT TTTTCGCAGT TTTGCGCGCC GCGACGGTCG
AAACCGCCGC CGCCCGTGCG ACCGCGTGCG GCGGCGCAAA CCGGAGACCA GCCGCAGCTC
AGTCTGTTCT GA
 
Protein sequence
MSERYDVEYP VMPPAPRKGR GALGNLQGRY ESVEREAVND GWTRDGEPRA PLRTQVFEER 
ARTILTRNAS PDIPFNVSLN PYRGCEHGCI YCFARPTHSY LGLSPGLDFE SRIYAKVNAA
ELLARELAKP RYVPEPIALG VNTDAYQPVE RERRITRQVI QVMHDHGQPF AAITKSSLIE
RDLDLLAPMA ERRQVMAAVT ITTLDPELAR ALEPRAATPS RRLRTIRALR DAGVPVGVSI
APMIPFVTEP DLERVLEACA DAGATHASYI VLRLPWEVAP LFTEWLAAHF PDRAERVMAR
VRDMRGGKDY DADFSRRMKG EGMWAELLKQ RFRMAVKRCG LNERARGILD FSQFCAPRRS
KPPPPVRPRA AAQTGDQPQL SLF