Gene BURPS1106A_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1420 
Symbol 
ID4900888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1393356 
End bp1394372 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content72% 
IMG OID640134650 
ProductLacI family transcriptional regulator 
Protein accessionYP_001065693 
Protein GI126455315 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCCA CCATCAAAGA CGTCGCCGCG CTCGCCGGCT TTTCGATCGC CACCGTGTCG 
CGCGCGATCA ACGCGCCGCA CACCGTCCAT CCGGCGACGC TCGAAAAGAT CCGCGCGGCG
ATCGGCGCGC TGCGCTTTCG CCCGAATCCG CTCGGCCGGC AATTGCGCAG CGACCGCACG
CAATTGATCG GCGTCGTGCT GCCGACGCTC GCGAATCCCG TGTTCGCCGA ATGCCTGCAG
GGCGTCGACG AACTCGCGAC GCAGGCCGGC TTCAAGCTGA TCGTGATGTC GACCGAATAC
GATGCGGCGC GCGAGCGCCA TGCGATCGAG ACACTGCGCG CGCAGCGCGT GGAAGGGCTG
ATGCTCACCG TCGCCGACGC CGACGCGCAC CCGCTGCTCG ACGAGCTCGA CCGCGACGGC
CCGCTCTACG TGCTGATGCA CAACGACACG CCGCATCGCC CGTCGGTGGC GGTCGACAAT
CGCCGCGCCG CGTACGACGG CGTGCGGATG CTGATCGAGC GCGGCCATCG GCGCGTGCTG
ATGCTCGCGG GCTCGCTCGA CGCATCCGAT CGCGCGCGGC TGCGCGTGCA CGGCTATGCG
CAGGCACTCG ACGAGCGCGG GCTCGAACCG CTGCCCGCGC TCGAGCTCGA CTTCAATGCA
CCCGCGCTGC CGCACGCGAT GCTCGCGCAT CTGAGCGCGC GCGCGACGCG CCCCACCGCG
CTCTTCTGCA GCAACGACTG GCTCGCGATG GTCGTGATTC GCGGGCTGCG CGACGCGCAC
CTCGCGGTGC CCGACGACAT GTCGGTGCTC GGCTTCGACG GCCTCGCGGT CGGCGAGCTG
CTCGCGCCGC CGCTCGCGAG CGTCGCGACG CCGAATCGCG AGATCGGCCG CGCCGCGTGG
CGGCGCCTTG CCGAGCGCAT CGCCGGCAAG CGCCATGCGC AACCCGCGCT GACGCTGCCG
CACGCGGTGC GCGACGGCGC GACCGTCGCG CCGCCGCGCG ATGCGCGCAT CGCCTGA
 
Protein sequence
MTPTIKDVAA LAGFSIATVS RAINAPHTVH PATLEKIRAA IGALRFRPNP LGRQLRSDRT 
QLIGVVLPTL ANPVFAECLQ GVDELATQAG FKLIVMSTEY DAARERHAIE TLRAQRVEGL
MLTVADADAH PLLDELDRDG PLYVLMHNDT PHRPSVAVDN RRAAYDGVRM LIERGHRRVL
MLAGSLDASD RARLRVHGYA QALDERGLEP LPALELDFNA PALPHAMLAH LSARATRPTA
LFCSNDWLAM VVIRGLRDAH LAVPDDMSVL GFDGLAVGEL LAPPLASVAT PNREIGRAAW
RRLAERIAGK RHAQPALTLP HAVRDGATVA PPRDARIA