Gene BURPS1106A_A1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1672 
Symbol 
ID4904478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1641497 
End bp1642558 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID640144777 
Productputative regulatory protein 
Protein accessionYP_001075705 
Protein GI126456840 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAG CGCGCATCAA AGCAGCGGCC GCCGATATGC CGGAACGCGC CCGCCTGCGC 
ATCGGCTTCG TCGCGCTGAG CGACGCCGCG CCGCTCGTCG CCGCACAGCG GCTCGAGCTC
GGCGCACGCT ACGGCTTGAC GCTCGAGCTC TGCCGGCAGC CGTCGTGGGC GAGCATTCGC
GACAAGCTGC TGTCGGGCGA GCTCGACGCC GCGCACGCGC TGTACGGGCT CGTCTACGGC
GTGCAGCTCG GCATCGGCGG GCCGCGCGCC GACCTGGCGG TGCCGATGGT GCTGAACCGC
AACGGCCAGG CGATCACGTT CTCGAACCGG CTCGCCGACG CGTACCGCGC GTCGGGCGAG
CTGAAGGCCG CGCTCGCGAC ACTCGGCCGG CGCCCCGTGT TCGCGCAGAC GTTCCCGACC
GGCACGCATG CGATGTGGCT GTATCACTGG CTCGCGTCGC ACGGCGTCGA TCCGCTGCAC
GATGTCCGCA GCGTCGTGAT TCCGCCGCCG GAGATGGTGG ACGCACTCGC GGCGGGCGAA
CTCGACGGGC TGTGCGTGGG CGAGCCGTGG AATGCGGTCG CCGAGGCGCG CGGCGCGGGC
AGGACGGTCG CGGCGACGAG CGAAGTGTGG CCCGACCATC CGGAAAAGGC GCTCGCGTGC
CGGCGCGAGT TCGTCGCGCT GTATCCGAAT ACGGCGCGCC TGCTGGTGCG CACGCTGCTC
GATGCGTGCG AATGGCTCGA CGACGCGGAC CACCGAATGA AGGCGGCCGC ATGGCTGGCG
GAGCCGGACG CGATCGGCGT GCCGATCGGG CAGATCGCGC CGCGGCTGCT CGGCGACTAC
GGCGCGGGGC CGTTTGCGCA GCCGCCCGCG CCGATCAAGT TCTACGAGCA CGGAACGGTG
AATCGGCCGG CCGCGAGCGA TGGGATGTGG TTCCTGTCGC AGTATCGGCG CTGGGGGATG
CTGAGCGGCG ACGTCGACGA TGCGGCGATC GCGAACGGCG TCGCGCACAC GGCGCTCTAC
GACGAAGCGG TCGCGCTCGG AGGGGCACGA CGCGGCGAGT GA
 
Protein sequence
MNTARIKAAA ADMPERARLR IGFVALSDAA PLVAAQRLEL GARYGLTLEL CRQPSWASIR 
DKLLSGELDA AHALYGLVYG VQLGIGGPRA DLAVPMVLNR NGQAITFSNR LADAYRASGE
LKAALATLGR RPVFAQTFPT GTHAMWLYHW LASHGVDPLH DVRSVVIPPP EMVDALAAGE
LDGLCVGEPW NAVAEARGAG RTVAATSEVW PDHPEKALAC RREFVALYPN TARLLVRTLL
DACEWLDDAD HRMKAAAWLA EPDAIGVPIG QIAPRLLGDY GAGPFAQPPA PIKFYEHGTV
NRPAASDGMW FLSQYRRWGM LSGDVDDAAI ANGVAHTALY DEAVALGGAR RGE