Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3881 |
Symbol | yhgF |
ID | 5589564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3852134 |
End bp | 3854455 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640927503 |
Product | protein yhgF |
Protein accession | YP_001464864 |
Protein GI | 157157022 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATG ATTCGTTCTG CCGCATTATT GCGGGTGAAA TTCAGGCGCG CCCGGAACAG GTTGACGCTG CCGTTCGCCT GCTTGACGAA GGGAATACCG TGCCGTTTAT CGCACGTTAT CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGCTGC GTAATCTGGA AACGCGTCTG AGCTATCTGC GCGAGCTGGA AGAGAGACGT CAGGCGATCC TCAAATCCAT TTCCGAGCAA GGCAAACTCA CCGATGATCT GGCGAAGGCC ATCAACGCCA CCCTAAGCAA AACCGAACTC GAAGACCTCT ACCTGCCCTA CAAACCTAAA CGCCGCACCC GCGGGCAAAT CGCCATTGAA GCAGGGCTTG AGCCGTTGGC TGACCTGCTG TGGAGCGATC CGTCACACAC GCCAGAAGTC GCCGCTGCAC AATATGTTGA TGCCGATAAA GGCGTGGCAG ATACCAAAGC CGCGCTGGAT GGCGCGCGCT ATATCCTGAT GGAACGGTTT GCCGAAGATG CCGCGCTGCT GGCGAAAGTG CGTGATTATC TGTGGAAGAA CGCGCATCTG GTCTCAACCG TGGTGAGCGG TAAAGAAGAG GAAGGGGCGA AATTCCGCGA CTATTTCGAT CATCACGAAC CGTTGTCCAC GGTGCCTTCT CACCGCGCGC TGGCGATGTT CCGTGGGCGT AACGAAGGCG TACTCCAGCT TTCGCTGAAT GCCGATCCGC AGTTCGATGA GCCGCCCAAA GAGAGCTATT GCGAGCAAAT CATCATGGAT CACCTTGGCC TGCGCCTGAA CAATGCCCCG GCGGATAGCT GGCGCAAAGG CGTAGTGAGC TGGACCTGGC GCATCAAGGT GCTGATGCAT CTGGAAACCG AACTGATGGG TACCGTGCGC GAACGCGCGG AAGATGAAGC AATCAACGTC TTTGCCCGTA ACCTGCACGA TCTGCTGATG GCGGCCCCTG CCGGACTGCG TGCAACGATG GGCCTCGATC CGGGTCTGCG TACCGGGGTA AAAGTGGCGG TGGTCGATGC CACTGGCAAA CTGGTGGCGA CCGATACCAT TTACCCGCAC ACCGGACAAG CCGCAAAAGC AGCGATGACC GTTGCTGCCT TGTGTGAAAA ACATAACGTT GAACTGGTGG CGATCGGCAA CGGTACAGCT TCCCGCGAAA CTGAACGTTT CTATCTCGAC GTGCAGAAGC AGTTCCCGAA AGTGACCGCA CAGAAAGTGA TCGTCAGCGA AGCTGGCGCG TCGGTTTACT CAGCTTCCGA GCTGGCAGCG CAGGAGTTCC CGGATCTCGA CGTTTCGCTG CGTGGTGCGG TATCTATCGC CCGCCGTTTG CAGGATCCGC TGGCGGAGCT GGTGAAAATC GATCCGAAAT CTATCGGCGT AGGTCAGTAT CAGCATGACG TCAGCCAGAC GCAACTGGCC CGCAAACTGG ATGCAGTAGT AGAAGACTGC GTAAACGCCG TTGGCGTCGA TCTCAACACC GCTTCTGTTC CGCTATTAAC CCGCGTGGCG GGCCTGACGC GCATGATGGC GCAAAACATC GTGGCCTGGC GCGATGAGAA CGGCCAGTTC AAGAACCGTC AGCAACTGCT GAAAGTCAGC CGTCTGGGGC CGAAAGCCTT CGAGCAGTGC GCGGGCTTCC TGCGCATTAA CCACGGTGAT AACCCGCTGG ACGCCTCTAC CGTTCACCCG GAAGCCTATC CGGTGGTGGA ACGCATTCTG GCAGCAACAC AGCAGGCACT GAAAGATCTG ATGGGTAACA GCAGCGAACT GCGTAACCTG AAAGCGTCTG ACTTTACTGA TGAAAAATTC GGTGTGCCGA CAGTAACTGA CATCATCAAA GAGCTGGAAA AACCGGGTCG CGATCCGCGT CCGGAATTTA AAACCGCTCA GTTTGCCGAT GGCGTCGAGA CAATGAACGA CCTGCAACCG GGTATGATCC TCGAAGGCGC TGTCACCAAC GTCACCAACT TTGGCGCGTT TGTCGATATT GGCGTGCATC AGGACGGCCT GGTTCACATC TCTTCATTGT CGAACAAGTT TGTGGAAGAT CCGCATACCG TGGTGAAAGC GGGCGACATT GTGAAGGTGA AAGTGCTGGA AGTGGATCTT CAGCGTAAAC GTATCGCCCT GACCATGCGT CTGGACGAGC AGCCTGGCGA AACCAACGCC CGTCGCGGCG GCGGTAATGA ACGCCCGCAA AACAACCGCC CGGCAGCCAA ACCACGCGGT CGTGAAGCGC AGCCTGCCGG TAATAGCGCG ATGATGGATG CGCTGGCGGC GGCAATGGGC AAAAAACGTT AA
|
Protein sequence | MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL EDLYLPYKPK RRTRGQIAIE AGLEPLADLL WSDPSHTPEV AAAQYVDADK GVADTKAALD GARYILMERF AEDAALLAKV RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN ADPQFDEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF KNRQQLLKVS RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR LDEQPGETNA RRGGGNERPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR
|
| |