Gene BURPS668_1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1174 
Symbol 
ID4883833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1151754 
End bp1152815 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content74% 
IMG OID640127102 
Producthypothetical protein 
Protein accessionYP_001058223 
Protein GI126438901 
COG category[R] General function prediction only 
COG ID[COG2962] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTCCA TCGCCGATTC CTCTCGCGCC GCGTTGCGCG GCGTGCTGTA CGTTGCGTTG 
TCGGCTGTCG CGTTCGGCGC GATGGCGATC TTCGGCCGCT ACGCGTACGC GGGCGGCGCC
GACGTGCTCG GCCTCCTGAT CGTTCGCTTT TCGATCGCGG GGGCGCTGCT CGTCGCCGTC
GCGCGCCGCC GCCGCGTGCG CTGGCCGCGC GGCCGCGCGC TCGCCGCGAT CGTCGGCATG
GGCGCGCTCG GCTATGTCGG CCAGTCGCTG TGCTATTTCA GCGCACTGCA ACACGCGCAG
GCGAGTCTCG TCGCGCTGCT GCTCTATCTA TACCCGGCGT TCGTCGCGCT GCTTGCCGCC
TGGTGGCTCG GCGAGCGGCT CACGCGCGCG AAGGCCGTTG CGCTCGCGCT GTGCGTCGCC
GGTTCGGCGC TGATGGTGGG CGGCGGCCGC GGCGAGCCGC TCGGCATCGC GCTCGCGCTC
GGCGCGGCCG TCGTCTACTC ACTGTATATC GTCGTCGGCG CGAAGGCGGC GCGCGGCGTC
GATCCGCTCG CGACCGTCGC GGTCATTTGT TGCGCCGCGG CCGCGATGCT CGCCATGCTC
GCGCTCGCGC GGGCAGCGGC GTTCGACGCG CCGCCGCATT GGCCGCGCGC GGCGGCCGGC
TGGGCGGCGC TCGTCGCGAT CGCGCTCGTG TCGACCGTCG CCGCGATGCT CGCGTTCTTC
GCCGGTCTCG CGCGGCTTGG CGCGGCCCGC ACGTCGATGC TCTCGACGCT CGAGCCCGTC
GTGACAGTCG CGCTTGCCGC CGCGTTGTTC GGCGAGACGC TGACGCCGCT GCAATGGGCG
GGCGGCGTCG CGATCCTGGC GGCGGTATTG TGGCTCGTGC GCGCGGGCGA CGCAGCCGAT
TCGCGCGGAG CCGGCGACGA TCGCGAGCGT CGCCGGCTCG GGCGGCGAGA TGACGAGCCG
AGTGCGCCGG GCGGGAGCGG GGCCGGCGGC GGGCCGGCTG GCTTCGTCGA TCCGAACGAA
TGCGGAATCC GGCGCGTACG GAGCGCGGAC GAGAACGCGT GA
 
Protein sequence
MPSIADSSRA ALRGVLYVAL SAVAFGAMAI FGRYAYAGGA DVLGLLIVRF SIAGALLVAV 
ARRRRVRWPR GRALAAIVGM GALGYVGQSL CYFSALQHAQ ASLVALLLYL YPAFVALLAA
WWLGERLTRA KAVALALCVA GSALMVGGGR GEPLGIALAL GAAVVYSLYI VVGAKAARGV
DPLATVAVIC CAAAAMLAML ALARAAAFDA PPHWPRAAAG WAALVAIALV STVAAMLAFF
AGLARLGAAR TSMLSTLEPV VTVALAAALF GETLTPLQWA GGVAILAAVL WLVRAGDAAD
SRGAGDDRER RRLGRRDDEP SAPGGSGAGG GPAGFVDPNE CGIRRVRSAD ENA