Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1894 |
Symbol | |
ID | 4905001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 1858480 |
End bp | 1859931 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640145000 |
Product | hypothetical protein |
Protein accession | YP_001075928 |
Protein GI | 126455619 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTCAA TGCTCTATTT TCCGATGGTA TCGGCTCTGA GCTTGCTCGG TGCCGACGCG CCGACGCACT TGCATTCGCA CTTGAAGTTG ATTCTGGGCG GCGAGTTCAA TGCGGCGCTC GAAAGATCGA GCGAATGGGC CGAAACGACG GTGGCATCGG AGCGGACATC ATGGGATCTG CAATTGCACG CGGATCTGCA GCTGGTGCTT GGCTTCGAAG TCGAAGCCGA AGAAAACTAT CGGCGCGCCC AGCGAAAAAT TCGCGGCTCA AACAGTAAGA TTCGCATCGC GACCTGCCGG AACGCCGCGT GGCAAGCCCT GTTCCGCTAC CGGGTCACGA CCGCGCTCGC GTGTTTTTCC CGAATCTGCG ACGAGCCCGG CATCGAGGCC GGCGGATTGG TGGAGGCGCG CTTTGGGATC GCCTGCGCGC TCTATGAAAT GGGGCGGATA GACGATGCGT TTGATGCGAT CGATTCGATG GAGAAGATCG CCGAACAGCA ATCGGACGAG ATGCGCGCGC ACTGGAAAGA CTTGATCGCC GTGTTGCGTT TCGATCTCGT CGTGCAAAGC GAATTGCGCC GGGCTGCGGC GTTCGTCGAT CATGTGTATT GGCAATCTGC GCAGTCGATG AGCCGGGTGG ACCGCGCGCA CGGTGTGTCG GAGGCCGCCG TATCCGTCGA GACGCCGCTG CTGCGCGGCC GGGTGGCCTA TCTGCTGCAG TTGCGATGCG CGGCCGCGGG CAATCGGGAC GCCGTCGCCG AGTTGGCGCG TTGCCTCGAT GCGGCGGGCG AGCAGGGATT CGTCGACTTT CGATACACGC TGCGCCTCGA GATTGCGCTC GCCCTGCTCG CGGGCGACGC GCCCAATTTG GCGCAATTCG TGTTGGAGCC GATTTCCGAC ACATTGCATG GCGCAGAGTC GAGCCGCCGC TATCGGGAAT ATTTCTATTG CGCCGCGAAG GTGCATCTGG CGCAGGACCA CACGCAGGAA TCGCTGGCCT TATACCGACG CTACGCGCTG ATCGCGATGA GATGTCTGCG CGAGGACGCG CTGATCGGCA GGCAGTTCCT GGTCGGGCAG GAACTGAAGC AGCTTCCCCA GTCCGACGAT GTGACCGTGC GCTTGCCGTT GAAATATCGA CGCGCCTATC ACTATATTCT CCAGAATCTC AACCGTAGCG ACCTTTCGGT TCGGGAGATC GCGGCGGAGA TCGGCGTCAC GGAGCGCGCG CTGCAGAACG CATTCAAGAT CTACCTCGGG CTTTCCCCGC GTGAACTGAT CCGCTCGCGG AGAATGGAGC GTATCCGCAC GGAACTCGTC GATTTCACGT TGACGGGTGA GCGCAACGTC AAGGAGGCGG CCCGAAAATG GGGTGTCCAG AATGGTTCGA CACTCGTGAT CGCCTATCGG AAGGAGTACG ACGAAACCCC TTCGGAAACG CTCGCGCGCT GA
|
Protein sequence | MFSMLYFPMV SALSLLGADA PTHLHSHLKL ILGGEFNAAL ERSSEWAETT VASERTSWDL QLHADLQLVL GFEVEAEENY RRAQRKIRGS NSKIRIATCR NAAWQALFRY RVTTALACFS RICDEPGIEA GGLVEARFGI ACALYEMGRI DDAFDAIDSM EKIAEQQSDE MRAHWKDLIA VLRFDLVVQS ELRRAAAFVD HVYWQSAQSM SRVDRAHGVS EAAVSVETPL LRGRVAYLLQ LRCAAAGNRD AVAELARCLD AAGEQGFVDF RYTLRLEIAL ALLAGDAPNL AQFVLEPISD TLHGAESSRR YREYFYCAAK VHLAQDHTQE SLALYRRYAL IAMRCLREDA LIGRQFLVGQ ELKQLPQSDD VTVRLPLKYR RAYHYILQNL NRSDLSVREI AAEIGVTERA LQNAFKIYLG LSPRELIRSR RMERIRTELV DFTLTGERNV KEAARKWGVQ NGSTLVIAYR KEYDETPSET LAR
|
| |