Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0337 |
Symbol | |
ID | 6065661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 388909 |
End bp | 389994 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641599736 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001723342 |
Protein GI | 170018388 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.467081 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGT TTCCTCTGCA AAGCCTGACG CTTATTGAGG CGCAGCAAAA GCAGTTTGCG CTGGTGGACA CGATTTGCCG TCACTTTCCC GGCGCGGAGT TTCTAACCAG CGGTGATTTG GGCTTAACGC CGGGGCTGAA TCAACCGCGT ATTACCCAAC GGGTGGAGCA GGTGCTGGCT GATGCATTTC ACGCACAGGC TGCGGCGCTG GTGCAGGGCG CGGGGACTGG CGCGATTCGC GCCGGGCTGG CGGCTTTGCT CAAACCGGGG CAGCGTCTTC TGGTGCATGA CGCGCCTGTT TACCCGACGA CACGGGTTAT TATTGAGCAG ATGGGGCTGA CGCTTATTAC TGTTGATTTC AATGACCTGT CGGCACTGAA GCAGGTCGTC GACGAGCAAC AACCGGATGC GGCGCTGGTG CAGCATACGC GCCAGCAGCC GCAGGACAGC TACGTGCTGG CAGATGTGCT GGCAACGTTG CGCGCGGCAG GTGTTCCAGC GTTAACCGAT GACAACTATG CGGTGATGAA GGTGGCGCGA ATCGGCTGTG AATGCGGCGC GAATGTCTCG ACATTTTCCT GCTTCAAGCT ATTTGGGCCA GAGGGTGTTG GTGCAGTGGT CGGCGATGCT GATGTTATCA ACCGTATTCG CGCCACGCTT TACTCCGGCG GTAGCCAGAT CCAGGGCGCA CAGGCGCTGG AAGTATTGCG TGGTCTGGTG TTTGCGCCAG TGATGCACGC GGTGCAGGCA GGGGTATCTG AACGGTTGCT GGCTTTGCTT AACGGTGGTG CGGTGCCGGA AGTGAAAAGC GCGGTGATTG CTAATGCGCA GTCGAAGGTG TTGATTGTCG AGTTTCATCA GCCGATTGCC GCCAGAGTGC TGGAAGAGGC GCAAAAGCGC GGTGCCTTGC CTTACCCGGT GGGTGCAGAG TCGAAATATG AAATCCCGCC GCTCGTTTAT CGCCTTTCCG GAACGTTTCG CCAGGCGAAT CCACAATCAG AACATTGTGC GATTCGCATT AACCCGAATC GCAGCGGTGA AGAGACGGTG CTGCGGATTT TGCGTGAGAG TATTGCCAGT ATTTAA
|
Protein sequence | MKTFPLQSLT LIEAQQKQFA LVDTICRHFP GAEFLTSGDL GLTPGLNQPR ITQRVEQVLA DAFHAQAAAL VQGAGTGAIR AGLAALLKPG QRLLVHDAPV YPTTRVIIEQ MGLTLITVDF NDLSALKQVV DEQQPDAALV QHTRQQPQDS YVLADVLATL RAAGVPALTD DNYAVMKVAR IGCECGANVS TFSCFKLFGP EGVGAVVGDA DVINRIRATL YSGGSQIQGA QALEVLRGLV FAPVMHAVQA GVSERLLALL NGGAVPEVKS AVIANAQSKV LIVEFHQPIA ARVLEEAQKR GALPYPVGAE SKYEIPPLVY RLSGTFRQAN PQSEHCAIRI NPNRSGEETV LRILRESIAS I
|
| |