Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pmen_2543 |
Symbol | |
ID | 5108010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas mendocina ymp |
Kingdom | Bacteria |
Replicon accession | NC_009439 |
Strand | + |
Start bp | 2807720 |
End bp | 2808730 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640503787 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001188031 |
Protein GI | 146307566 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.354483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0000426703 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGAGCT TCACCCTACC CGCGCAGTAC CTGCGGCAGA TCGTCGGCCT GGTCGGCGAC ATGGGCGCCG ACGTGCCGGG CTGGCTGGCG CGGGCGCAAC TGCACGAGGC GCGACTGGGC GACAGCGACC TGCACCTGTC GCTGGCCACC TTCGAGCGCT TGATCGAGGA CGCCATGGCG CGTACCGACG AGCCGGCCTT CGGCCTGCTG GTGGGCGAGC GCCTGGTGGT CAATACCCAC GGCCTGCTGG GCTATGCGGC GATGAACAGC ACCACCCTGC GCCAGGCCGC CCAGCTGATC GAGCGCTACA TCGCCCTGCG CACCAGCCTG GTGAGCATCC GCCTGGTGGA GACGGAGGGG GAGGCGCGCC TGGTCTTCGC CGAGGCGGCG CCGCTGGGCG GCATCGCCCG GCCGGTGCTG GAGGCGGTGA TCCTGGCGAT CAAGAACGTG CTGGACTTCA TGACCCTGGG CAGTTGCCCG CTGGAGCGGG TCAGCTTCGC CTTCGCCAGG CCCGCCTACG CCGGACTGGC CCATCAGCTG TGCCGCTGCG AGGTGCGCTA CGACGCCGAC TGGAGCGGCT TCGTCCTGCC CGCGCAGAGC ATCGACCAGC CGCTGAAGAC CGCCGACGCC GCCAGCTTTC GTGATACCGA ACAGATACTC CAGCGCGAAC TGGCCAAGCT CACCGCCGAG CAGTCCATGA GCAGCCGGGT ACGCCGGGTG CTGCTGGAAA AACAGGGTGG CTTTCCCTCG CTCAGCCTTA CCGCCCGCCT CTTCCACCTG ACCCCGCGCA CCCTGCACCG CCGCCTGCAG GCCGAAGGCA CCTCGTTCAA GAGCCTGCTC GAGGAGGTGC GCCACACCCT GGCCCAGGAG CATCTCAAGG CCGGGCGCAT GACGGTGGAA GAGATCGCCT ACAGCCTGGG CTACACCGAC CTGGCCAACT TCCGCCGCGC CTTCAAGCGT TGGCAGCGCC AGTCGCCCTC GGCCTATCGC GCCGAACAGC AGGCGCTGTA G
|
Protein sequence | MQSFTLPAQY LRQIVGLVGD MGADVPGWLA RAQLHEARLG DSDLHLSLAT FERLIEDAMA RTDEPAFGLL VGERLVVNTH GLLGYAAMNS TTLRQAAQLI ERYIALRTSL VSIRLVETEG EARLVFAEAA PLGGIARPVL EAVILAIKNV LDFMTLGSCP LERVSFAFAR PAYAGLAHQL CRCEVRYDAD WSGFVLPAQS IDQPLKTADA ASFRDTEQIL QRELAKLTAE QSMSSRVRRV LLEKQGGFPS LSLTARLFHL TPRTLHRRLQ AEGTSFKSLL EEVRHTLAQE HLKAGRMTVE EIAYSLGYTD LANFRRAFKR WQRQSPSAYR AEQQAL
|
| |