Gene EcE24377A_3881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3881 
SymbolyhgF 
ID5589564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3852134 
End bp3854455 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content57% 
IMG OID640927503 
Productprotein yhgF 
Protein accessionYP_001464864 
Protein GI157157022 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATG ATTCGTTCTG CCGCATTATT GCGGGTGAAA TTCAGGCGCG CCCGGAACAG 
GTTGACGCTG CCGTTCGCCT GCTTGACGAA GGGAATACCG TGCCGTTTAT CGCACGTTAT
CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGCTGC GTAATCTGGA AACGCGTCTG
AGCTATCTGC GCGAGCTGGA AGAGAGACGT CAGGCGATCC TCAAATCCAT TTCCGAGCAA
GGCAAACTCA CCGATGATCT GGCGAAGGCC ATCAACGCCA CCCTAAGCAA AACCGAACTC
GAAGACCTCT ACCTGCCCTA CAAACCTAAA CGCCGCACCC GCGGGCAAAT CGCCATTGAA
GCAGGGCTTG AGCCGTTGGC TGACCTGCTG TGGAGCGATC CGTCACACAC GCCAGAAGTC
GCCGCTGCAC AATATGTTGA TGCCGATAAA GGCGTGGCAG ATACCAAAGC CGCGCTGGAT
GGCGCGCGCT ATATCCTGAT GGAACGGTTT GCCGAAGATG CCGCGCTGCT GGCGAAAGTG
CGTGATTATC TGTGGAAGAA CGCGCATCTG GTCTCAACCG TGGTGAGCGG TAAAGAAGAG
GAAGGGGCGA AATTCCGCGA CTATTTCGAT CATCACGAAC CGTTGTCCAC GGTGCCTTCT
CACCGCGCGC TGGCGATGTT CCGTGGGCGT AACGAAGGCG TACTCCAGCT TTCGCTGAAT
GCCGATCCGC AGTTCGATGA GCCGCCCAAA GAGAGCTATT GCGAGCAAAT CATCATGGAT
CACCTTGGCC TGCGCCTGAA CAATGCCCCG GCGGATAGCT GGCGCAAAGG CGTAGTGAGC
TGGACCTGGC GCATCAAGGT GCTGATGCAT CTGGAAACCG AACTGATGGG TACCGTGCGC
GAACGCGCGG AAGATGAAGC AATCAACGTC TTTGCCCGTA ACCTGCACGA TCTGCTGATG
GCGGCCCCTG CCGGACTGCG TGCAACGATG GGCCTCGATC CGGGTCTGCG TACCGGGGTA
AAAGTGGCGG TGGTCGATGC CACTGGCAAA CTGGTGGCGA CCGATACCAT TTACCCGCAC
ACCGGACAAG CCGCAAAAGC AGCGATGACC GTTGCTGCCT TGTGTGAAAA ACATAACGTT
GAACTGGTGG CGATCGGCAA CGGTACAGCT TCCCGCGAAA CTGAACGTTT CTATCTCGAC
GTGCAGAAGC AGTTCCCGAA AGTGACCGCA CAGAAAGTGA TCGTCAGCGA AGCTGGCGCG
TCGGTTTACT CAGCTTCCGA GCTGGCAGCG CAGGAGTTCC CGGATCTCGA CGTTTCGCTG
CGTGGTGCGG TATCTATCGC CCGCCGTTTG CAGGATCCGC TGGCGGAGCT GGTGAAAATC
GATCCGAAAT CTATCGGCGT AGGTCAGTAT CAGCATGACG TCAGCCAGAC GCAACTGGCC
CGCAAACTGG ATGCAGTAGT AGAAGACTGC GTAAACGCCG TTGGCGTCGA TCTCAACACC
GCTTCTGTTC CGCTATTAAC CCGCGTGGCG GGCCTGACGC GCATGATGGC GCAAAACATC
GTGGCCTGGC GCGATGAGAA CGGCCAGTTC AAGAACCGTC AGCAACTGCT GAAAGTCAGC
CGTCTGGGGC CGAAAGCCTT CGAGCAGTGC GCGGGCTTCC TGCGCATTAA CCACGGTGAT
AACCCGCTGG ACGCCTCTAC CGTTCACCCG GAAGCCTATC CGGTGGTGGA ACGCATTCTG
GCAGCAACAC AGCAGGCACT GAAAGATCTG ATGGGTAACA GCAGCGAACT GCGTAACCTG
AAAGCGTCTG ACTTTACTGA TGAAAAATTC GGTGTGCCGA CAGTAACTGA CATCATCAAA
GAGCTGGAAA AACCGGGTCG CGATCCGCGT CCGGAATTTA AAACCGCTCA GTTTGCCGAT
GGCGTCGAGA CAATGAACGA CCTGCAACCG GGTATGATCC TCGAAGGCGC TGTCACCAAC
GTCACCAACT TTGGCGCGTT TGTCGATATT GGCGTGCATC AGGACGGCCT GGTTCACATC
TCTTCATTGT CGAACAAGTT TGTGGAAGAT CCGCATACCG TGGTGAAAGC GGGCGACATT
GTGAAGGTGA AAGTGCTGGA AGTGGATCTT CAGCGTAAAC GTATCGCCCT GACCATGCGT
CTGGACGAGC AGCCTGGCGA AACCAACGCC CGTCGCGGCG GCGGTAATGA ACGCCCGCAA
AACAACCGCC CGGCAGCCAA ACCACGCGGT CGTGAAGCGC AGCCTGCCGG TAATAGCGCG
ATGATGGATG CGCTGGCGGC GGCAATGGGC AAAAAACGTT AA
 
Protein sequence
MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL 
SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL EDLYLPYKPK RRTRGQIAIE
AGLEPLADLL WSDPSHTPEV AAAQYVDADK GVADTKAALD GARYILMERF AEDAALLAKV
RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN
ADPQFDEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR
ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH
TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA
SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA
RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF KNRQQLLKVS
RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL
KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN
VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR
LDEQPGETNA RRGGGNERPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR