Gene B21_03211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03211 
SymbolyhgF 
ID8115688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3396976 
End bp3399297 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content57% 
IMG OID644849388 
Producthypothetical protein 
Protein accessionYP_003000961 
Protein GI251786657 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATG ATTCGTTCTG CCGCATTATT GCGGGTGAAA TTCAGGCGCG CCCGGAACAG 
GTTGACGCTG CCGTTCGCCT GCTTGACGAA GGGAATACCG TGCCGTTTAT CGCACGTTAT
CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGCTGC GTAATCTGGA AACGCGTCTG
AGCTATCTGC GCGAGCTGGA AGAGAGACGT CAGGCGATCC TCAAGTCCAT TTCCGAGCAA
GGCAAACTCA CCGATGATCT GGCGAAGGCC ATCAACGCCA CCCTAAGCAA AACCGAACTC
GAAGACCTCT ACCTGCCCTA CAAACCTAAA CGCCGCACCC GCGGGCAAAT CGCCATTGAA
GCAGGGCTTG AGCCGTTGGC TGACCTGCTG TGGAGCGATC CGTCACACAC GCCAGAAGTC
GCCGCTGCAC AATATGTTGA TGCCGATAAA GGCGTGGCAG ATACCAAAGC CGCGCTGGAT
GGCGCGCGCT ATATCCTGAT GGAACGGTTT GCCGAAGATG CCGCGCTGCT GGCGAAAGTG
CGTGATTATC TGTGGAAGAA CGCGCATTTG GTTTCTACGG TGGTGAGCGG TAAAGAAGAG
GAAGGGGCGA AATTCCGCGA CTATTTCGAT CATCACGAAC CGTTGTCCAC GGTGCCTTCT
CACCGCGCGC TGGCGATGTT CCGTGGGCGT AACGAAGGCG TACTCCAGCT TTCGCTGAAT
GCCGATCCGC AGTTCGATGA GCCGCCCAAA GAGAGCTATT GCGAGCAAAT CATCATGGAT
CACCTTGGCC TACGCCTGAA CAATGCCCCG GCGGATAGCT GGCGCAAAGG CGTAGTGAGC
TGGACCTGGC GCATCAAGGT GCTGATGCAT CTGGAAACCG AACTGATGGG TACCGTGCGC
GAACGCGCGG AAGATGAAGC AATCAACGTC TTTGCCCGTA ACCTGCACGA TCTGCTGATG
GCGGCCCCTG CCGGACTGCG TGCAACGATG GGCCTCGATC CGGGTCTGCG TACTGGGGTA
AAAGTGGCGG TGGTCGATGC CACTGGCAAA CTGGTAGCGA CCGATACCAT TTACCCGCAC
ACCGGACAAG CCGCAAAAGC AGCGATGACC GTTGCTGCCT TGTGTGAAAA ACATAACGTT
GAACTGGTGG CGATCGGCAA CGGTACAGCT TCCCGCGAAA CTGAACGTTT CTATCTCGAC
GTGCAGAAGC AGTTCCCGAA AGTGACCGCA CAGAAAGTGA TCGTCAGCGA AGCTGGCGCG
TCGGTTTACT CAGCTTCCGA GCTGGCAGCG CAGGAGTTCC CGGATCTCGA CGTTTCGCTG
CGTGGCGCGG TGTCTATCGC CCGCCGTTTG CAGGATCCGC TGGCGGAGCT GGTGAAAATC
GATCCGAAAT CTATCGGCGT AGGTCAGTAT CAGCATGACG TCAGCCAGAC GCAACTGGCC
CGCAAACTGG ACGCAGTAGT AGAAGACTGC GTAAACGCCG TTGGCGTCGA TCTCAACACT
GCTTCTGTTC CGCTGTTAAC CCGCGTGGCG GGCCTGACGC GCATGATGGC GCAAAACATC
GTGGCCTGGC GCGATGAGAA CGGCCAGTTC CAGAACCGTC AGCAACTGCT GAAAGTCAGC
CGTCTGGGGC CGAAAGCCTT CGAGCAGTGC GCGGGCTTCT TGCGCATTAA CCACGGTGAT
AACCCGCTGG ATGCTTCTAC CGTCCACCCG GAAGCCTATC CGGTGGTGGA ACGCATTCTG
GCAGCAACAC AGCAGGCACT GAAAGATCTG ATGGGTAACA GCAGCGAACT GCGTAACCTG
AAAGCGTCTG ACTTTACTGA TGAAAAATTC GGTGTGCCGA CAGTAACTGA CATCATCAAA
GAGCTGGAAA AACCGGGTCG CGATCCGCGT CCGGAATTTA AAACCGCTCA GTTTGCCGAT
GGCGTCGAGA CAATGAACGA CCTGCAACCG GGTATGATCC TCGAAGGTGC GGTGACCAAC
GTCACCAACT TTGGCGCGTT TGTCGATATC GGCGTGCATC AGGACGGCCT GGTTCACATC
TCTTCATTGT CGAACAAGTT TGTGGAAGAT CCGCATACCG TGGTGAAAGC GGGCGACATT
GTGAAGGTGA AAGTGCTGGA AGTGGATCTT CAGCGTAAAC GTATCGCCCT GACTATGCGT
CTGGACGTGC AGCCTGGCGA AACCAACGCC CGTCGCGGCG GCGGTAATGA ACGCCCGCAA
AACAACCGCC CGGCAGCCAA ACCACGCGGT CGTGAAGCGC AGCCTGCCGG TAATAGCGCG
ATGATGGATG CGCTGGCGGC GGCAATGGGC AAAAAACGTT AA
 
Protein sequence
MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL 
SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL EDLYLPYKPK RRTRGQIAIE
AGLEPLADLL WSDPSHTPEV AAAQYVDADK GVADTKAALD GARYILMERF AEDAALLAKV
RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN
ADPQFDEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR
ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH
TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA
SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA
RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS
RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL
KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN
VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR
LDVQPGETNA RRGGGNERPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR