Gene BURPS1106A_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1020 
Symbol 
ID4901943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp996698 
End bp997849 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content64% 
IMG OID640134250 
Producthypothetical protein 
Protein accessionYP_001065300 
Protein GI126453481 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTCT ACGAAAAATA TTTCGCGCGG CAGATCTACG TCACGTTCAT CTTCGTGCTG 
TTCGCGTTTT CAGGGCTGTT CTTCTTCTTC GACCTGATCA GCGAACTGAA CTCGGTCGGC
CACGGCAACT ACAAGTTCGG CTACGCGGTG CTGCGCGTCG CGCTGCAGGC ACCGTCGCGC
TTCTATGAAA TCATCCCGGT CGCCGCGCTG ATCAGCGCGA TCTACGTATT CGCGCAGATG
GCCGCGAACT CGGAGTTCAC GATCTTCCGC GTGTCCGGCC TCGCGACGAA CCAGGCGCTG
CGCTCGCTCG TGAAGATCGG CGTGCCGATC GTCATCGCGA CCTACCTGAT CGGCGAATTC
ATCGGCCCGT ACTCGGATCA GCTGTCCGAG CGCGTGCGGC TCGAGGCGCT CGGCTCGTCG
GTGTCGACGA ACTTCGCGTC GGGCGTCTGG GTGAAGGACA CGCTCACCGC GCGCGACAAC
GGCGAGCCCG TCACGCGCTT CGTGAACGTC GGCACGCTGT CGCCCGACTC GACGATCAGC
GACGTGCGCA TCTACGAGTT CGATTCGAAG TTCAACCTGC AGAACGTGCG GATCGCGAAG
CGCGGCCACT ACCAGCCGCC CGGCCACTGG CTGCTGACGG ACGTCACCGA TACGCAGCTC
ACGAGCCTCG CGGGCAACGG CACCGCATCG CCCGTCGATA CGCTCAACCC CGTCTATCGC
GCGCAGCAGG TCACGCTGCC GCAGTATTCG CTGCGCTCGG ACCTGACGCC GCAGATCCTG
TCGGTGCTGC TCGTGTCGCC CGAGCGGATG TCGCTCTTCA ATCTGTTTCG CTACATCCAG
CATCTGAAAG AGAACCAGCA GGACACGCAG CGCTACGACA TCGCGCTGTG GCGCAAGCTG
CTGTATCCGT TCGCGGTGTT CGTGATGCTC GTGCTGTCGC TGCCGTTCGC GTACCTGCAC
ACGCGCGCGG GCGTCGTCGG CGTGAAGGTG TTCGGCGGGA TCATGCTCGG CATGAGCTTC
CAGCTCTTCA ACACGCTGTT CTCGCACATC GGTACGCTGA ACACGTGGCC CGCGCCGCTC
ACGGCCGCGC TGCCCGGCTG CATCTATCTC GCGCTCGGCC TCTTCGCGCT GAAGTGGGTC
GATCGGCACT GA
 
Protein sequence
MRLYEKYFAR QIYVTFIFVL FAFSGLFFFF DLISELNSVG HGNYKFGYAV LRVALQAPSR 
FYEIIPVAAL ISAIYVFAQM AANSEFTIFR VSGLATNQAL RSLVKIGVPI VIATYLIGEF
IGPYSDQLSE RVRLEALGSS VSTNFASGVW VKDTLTARDN GEPVTRFVNV GTLSPDSTIS
DVRIYEFDSK FNLQNVRIAK RGHYQPPGHW LLTDVTDTQL TSLAGNGTAS PVDTLNPVYR
AQQVTLPQYS LRSDLTPQIL SVLLVSPERM SLFNLFRYIQ HLKENQQDTQ RYDIALWRKL
LYPFAVFVML VLSLPFAYLH TRAGVVGVKV FGGIMLGMSF QLFNTLFSHI GTLNTWPAPL
TAALPGCIYL ALGLFALKWV DRH