Gene B21_02903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02903 
SymbolygjO 
ID8116457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3100026 
End bp3101162 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID644849090 
Producthypothetical protein 
Protein accessionYP_003000663 
Protein GI251786359 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2813] 16S RNA G1207 methylase RsmC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.781708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACT TAGACAACGG TTTCCGTTCA CTGACACTAC AACGTTTTCC GGCGACGGAT 
GACGTTAACC CGCTACAGGC GTGGGAAGCG GCGGATGAAT ATTTGCTGCA ACAGTTGGAC
GACACAGAAA TCCGCGGCCC GGTGTTGATC CTGAATGATG CCTTTGGTGC GTTAAGCTGC
GCGCTGGCGG AACATAAGCC GTACAGCATT GGCGACTCAT ACATCAGTGA ACTGGCGACG
CGCGAGAATT TACGCCTCAA CGGGATTGAT GAGTCGAGCG TGAAGTTTCT CGACAGCACC
GCCGACTACC CGCAACAGCC GGGTGTGGTG CTGATCAAAG TGCCGAAAAC ACTGGCATTG
CTGGAACAGC AACTGCGTGC GCTGCGCAAA GTGGTCACGT CGGATACACG TATTATTGCC
GGTGCCAAGG CCCGTGACAT TCACACTTCC ACGCTGGAAC TGTTCGAAAA AGTGCTCGGC
CCAACCACCA CCACGCTGGC ATGGAAAAAA GCGCGCCTGA TTAATTGCAC TTTCAATGAA
CCGCAGCTGG CCGATGCGCC GCAGACCGTT AGCTGGAAGC TGGAAGGTAC TGACTGGACT
ATCCACAACC ATGCGAATGT CTTCTCCCGC ACCGGGCTTG ATATCGGCGC GCGCTTCTTT
ATGCAACATC TGCCAGAGAA TCTCGAAGGG GAGATTGTCG ATCTCGGTTG CGGTAATGGC
GTTATTGGTC TGACGCTGCT TGATAAAAAC CCGCAGGCGA AAGTGGTGTT TGTCGATGAA
TCGCCGATGG CGGTTGCTTC CAGCCGTTTG AACGTTGAAA CCAACATGCC AGAGGCGTTG
GATCGCTGCG AGTTTATGAT CAACAACGCG CTCTCCGGCG TGGAGCCTTT CCGCTTTAAT
GCTGTGCTCT GCAACCCGCC GTTTCACCAA CAACATGCGC TGACCGATAA CGTCGCCTGG
GAGATGTTCC ACCACGCCCG CCGCTGCCTG AAAATCAACG GCGAGCTGTA TATCGTTGCC
AACCGTCACC TGGATTACTT CCATAAACTG AAGAAGATTT TCGGCAACTG CACCACTATT
GCGACGAATA ATAAATTTGT GGTACTGAAA GCGGTGAAGC TGGGACGTCG TCGGTAA
 
Protein sequence
MSHLDNGFRS LTLQRFPATD DVNPLQAWEA ADEYLLQQLD DTEIRGPVLI LNDAFGALSC 
ALAEHKPYSI GDSYISELAT RENLRLNGID ESSVKFLDST ADYPQQPGVV LIKVPKTLAL
LEQQLRALRK VVTSDTRIIA GAKARDIHTS TLELFEKVLG PTTTTLAWKK ARLINCTFNE
PQLADAPQTV SWKLEGTDWT IHNHANVFSR TGLDIGARFF MQHLPENLEG EIVDLGCGNG
VIGLTLLDKN PQAKVVFVDE SPMAVASSRL NVETNMPEAL DRCEFMINNA LSGVEPFRFN
AVLCNPPFHQ QHALTDNVAW EMFHHARRCL KINGELYIVA NRHLDYFHKL KKIFGNCTTI
ATNNKFVVLK AVKLGRRR