Gene B21_01965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01965 
SymbolyegI 
ID8114654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2047352 
End bp2049247 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content52% 
IMG OID644848179 
Producthypothetical protein 
Protein accessionYP_002999752 
Protein GI251785448 
COG category[R] General function prediction only 
COG ID[COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGCCGTGAAC TGGGCAAAGG CGGTGAAGGT GCGGTTTATG ATATCGAGGA GTTTGCCGAT 
AGCGTCGCCA AGATTTATCA CACGCCGCCA CCCGCCTTAA AACAGGACAA ACTTGCCTTT
ATGGCTGCGA CAGCTGACGC GCAGTTGTTG AATTATGTCG CCTGGCCGCA GGCAACGCTT
CACGGTGGGC GAGGCGGAAA AGTTATCGGT TTTATGATGC CAAAAGTTTC TGGTAAAGAA
CCGATTCATA TGATCTATAG CCCGGCACAT CGTCGCCAGA GTTACCCTCA TTGTGCGTGG
GATTTTCTAC TCTATGTTGC GCGCAATATT GCTTCATCTT TTGCTACGGT TCACGAGCAC
GGGCACGTCG TGGGGGACGT AAACCAGAAC AGCTTTATGG TAGGTCGCGA CAGCAAAGTG
GTGTTGATCG ATAGTGACTC CTTTCAGATT AACGCCAATG GCACACTGCA TTTATGCGAA
GTCGGCGTGT CGCATTTTAC GCCGCCAGAG CTACAAACCT TGCCATCATT TGTCGGTTTT
GAACGTACCG CGAATCACGA TAATTTTGGC CTTGCGTTGC TGATTTTTCA CGTCTTGTTT
GGTGGGCGGC ATCCTTATTC TGGTGTGCCG CTTATCTCTG ATGCGGGTAA TGCGCTGGAG
ACGGATATTG CCCATTTCCG TTATGCCTAC GCGTCAGATA ATCAGCGACG TGGTTTAAAA
CCGCCGCCAC GATCGATTCC GCTGTCGATG TTACCGGGCG ATGTTGAAGC CATGTTTCAG
CAGGCGTTTA CGGAAAGTGG TGTAGCAACC GGGCGTCCGA CGGCTAAAGC GTGGGTAGCA
GCACTGGATT CTCTACGCCA ACAGTTAAAG AAATGTACCG TTTCGGCAAT GCATGTTTAT
CCCGCTCATT TGACCGACTG CCCGTGGTGT ACGCTGGATA ATCAAGGCGT TATCTATTTT
ATTGATCTCG GCGAAGAGGT CATTACCACC GGCGGTGATT TTGTGCTGGC GAAAGTCTGG
GCGATGGTGA TGGCGTCAGT AGCACCGCCA GCAGTGCAAT TGCCATTACC CGATCATTTC
CAACCGACTG GCAGGCCGCT TCCTTTAGGC CTGTTACGGC GCGAATACAT CATTCTGCTT
GAGATCGCAC TGTCAGCGTT ATCGCTGTTG CTTTGCGGCC TTCAGGCAGA ACCGCGTTAT
ATTATTTTGG TTCCTGTGCT GGCGGCTATC TGGATTATTG GCAGTCTGAC AAGCAAAGCT
TATAAAGCAG AAATCCAGCA ACGCCGTGAG GCATTTAATC GCGCGAAAAT GGACTATGAC
CATTTAGTCA GCCAGATCCA ACAGTTGGGC GGGCTGGAAG GTTTTATCGC CAAACGGACG
ATGCTCGAAA AAATGAAGGA CGAAATGCTC GGGTTACCGG AGGAAGAAAA ACGTGCTCTG
GCAGCACTTC ACGACACCGC AAGGGAACGG CAGAAGCAGA AGTTTCTGGA GGGATTTTTT
ATTGATGTTG CCTCTATTCC CGGTGTTGGC CCTGCGCGTA AAGCGGCGTT ACGGTCCTTT
GGTATTGAAA CAGCAGCGGA TGTTACCCGT CGTGGGGTTA AGCAAGTTAA AGGGTTTGGT
GATCATCTGA CCCAGGCGGT CATCGACTGG AAAGCGAGCT GTGAACGCCG TTTTGTGTTC
AGGCCGAACG AAGCGGTAAC GCCTGCAGAC AGACAAGCGG TAATGGCGAA AGTGGCCGCC
AAACGACATC GGCTGGAATC GGCGTTGACT GTCGGCGCGA CAGAGTTGCA GCGATTCCGC
CTTCATGCTC CAGCACGGAC CATGCCGTTG ATGGAACCGT TACGTCAGGC GGCAGAAAAA
CTGGCTCAGG CGCAGGCAGA TTTAAGCCGC TGCTGA
 
Protein sequence
GRELGKGGEG AVYDIEEFAD SVAKIYHTPP PALKQDKLAF MAATADAQLL NYVAWPQATL 
HGGRGGKVIG FMMPKVSGKE PIHMIYSPAH RRQSYPHCAW DFLLYVARNI ASSFATVHEH
GHVVGDVNQN SFMVGRDSKV VLIDSDSFQI NANGTLHLCE VGVSHFTPPE LQTLPSFVGF
ERTANHDNFG LALLIFHVLF GGRHPYSGVP LISDAGNALE TDIAHFRYAY ASDNQRRGLK
PPPRSIPLSM LPGDVEAMFQ QAFTESGVAT GRPTAKAWVA ALDSLRQQLK KCTVSAMHVY
PAHLTDCPWC TLDNQGVIYF IDLGEEVITT GGDFVLAKVW AMVMASVAPP AVQLPLPDHF
QPTGRPLPLG LLRREYIILL EIALSALSLL LCGLQAEPRY IILVPVLAAI WIIGSLTSKA
YKAEIQQRRE AFNRAKMDYD HLVSQIQQLG GLEGFIAKRT MLEKMKDEML GLPEEEKRAL
AALHDTARER QKQKFLEGFF IDVASIPGVG PARKAALRSF GIETAADVTR RGVKQVKGFG
DHLTQAVIDW KASCERRFVF RPNEAVTPAD RQAVMAKVAA KRHRLESALT VGATELQRFR
LHAPARTMPL MEPLRQAAEK LAQAQADLSR C