Gene B21_03389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03389 
Symbolybl149 
ID8116248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3613469 
End bp3615472 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content54% 
IMG OID644849562 
Producthypothetical protein 
Protein accessionYP_003001135 
Protein GI251786831 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAGCC TTAAGGAGCA CAAGATGAAC ATTTCGGAAG TCGATCTGCG TAAACTGACG 
GTCAGCGATC CGTTCCTCGG TCAGTACCAA CAACTGGTCC GCGACGTGGT GATTTCTTAT
CAATGGGATG CCTTGAACGA TCGTATCCCA GAAGCGGAAC CCAGCCATGC GATTGAAAAC
TTTCGCATTG CTGCCGGACT TCAGGAGGGT GAATTTTACG GGATGGTGTT TCAGGACAGC
GACGTCGCCA AATGGCTGGA AGCGGTAGCC TGGTCGCTGT GCCAGAAGCC GGACGCCGAA
CTGGAAAAAA CCGCCGACGA GGTAATCGAA CTGATCGCCT CCGCCCAATG TGAAGACGGC
TATCTCAATA CTTACTTTAC GGTAAAAGCA CCCGAAGAAC GCTGGAGCAA TCTTGCGGAG
TGTCATGAAC TTTACTGCGC CGGTCATCTG ATTGAAGCCG GAGTCGCCTT CTTCCAGGCC
ACGGGAAAAC GACGCTTGCT GGAGGTGGTT TGCCGTCTGG CCGATCATAT CGACCGCGTA
TTTGGTCCAG ATGAAAGTAA GTTACACGGT TATCCTGGTC ACCCGGAAAT TGAACTGGCA
CTAATGCGCC TGTATGAAGT GACTGAAGAG CCGCGCTACC TGGCGCTGAC GAACTATTTT
GTCGAACAGC GTGGTGCGCA ACCGCACTAT TACGACCAAG AATATGAAAA GCGCGGGCAG
ACATCGCACT GGCACACCTA CGGCCCGGCG TGGATGGTGA AAGACAAAGC CTACAGCCAG
GCACATTTGT CCCTTGCGCA ACAGCAAACC GCCATCGGTC ACGCGGTACG TTTTGTCTAC
CTGATGACCG GCGTCGCGCA TCTCGCGCGT TTAAGTCACG ATGACAGCAA GCGTCAGGAC
TGCCTGAGGC TGTGGAACAA TATGGCCCAG CGTCAGTTAT ATATTACCGG CGGCATTGGC
TCGCAAAGCA GCGGCGAAGC GTTCACTAGC GATTACGATC TGCCGAATGA CACGGTTTAC
GCCGAAAGTT GTGCTTCCAT CGGCCTGATG ATGTTCGCCC GGCGAATGCT GGAAATGGAA
GGCGACAGTC AATATGCCGA TGTGATGGAG CGCGCGCTGT ACAACACCGT GCTCGGCGGC
ATGGCGCTGG ATGGCAAACA TTTCTTCTAT GTGAATCCGC TGGAAGTACA TCCAAAATCG
CTGAAATTCA ACCATATCTA CGATCACGTT AAACCGATCC GCCAGCGTTG GTTTGGCTGC
GCTTGTTGTC CGCCAAATAT CGCCCGCGTG CTGACCTCGA TTGGTCATTA TCTCTACACG
CCGCGTGAAG ATGCGTTGTA TATCAACATA TACGCAGGAA ACAGCATGGA AGTGCCGGTA
GAAAATGGCA CGCTGCGCCT GCGGGTTAGC GGGAACTATC CGTGGCAGGA GCAGGTGACG
ATTGCGGTTG AATCGCCCCA GCCGGTACGT CATACGCTGG CTTTACGTCT GCCGGACTGG
TGCACACAGC CGCAGATCAT ATTGAATGGG GAAGAGGTCG AGCAGGATAT TCGTAAAGGG
TATTTGCACA TTACCCGCGA ATGGCAGGAG GGCGATACGC TGAATCTGAC TTTGCCGATG
CCGGTACGCC GCGTTTACGG TAACCCGCTG GTGCGTCACG TCGCCGGAAA AGTGGCGATT
CAGCGCGGCC CGCTGGTGTA TTGCCTGGAA CAGGCCGACA ACGGCGAGTC ACTGCATAAT
CTGTGGCTGC CCACCGATGC GCCATTTACG ACATTTGAAG GCAAGGGATT GTTTAGCCAT
AAGATCTTAA TCCAGGCACC GGGTTACCGG TATGAACAGA GCAATCCAGA GCAGCAACCG
CTGTGGCATT ACGACAGCGC GCCAGCCAAA CGCCAGCCGC AAACTCTGAC GTTTATCCCG
TGGTTTAGCT GGGCTAACCG GGGCGAAGGC GAAATGCGGA TCTGGGTGAA TGAGGAAAAG
CATCGCCATC CGGAGGTTGG ATAA
 
Protein sequence
MYSLKEHKMN ISEVDLRKLT VSDPFLGQYQ QLVRDVVISY QWDALNDRIP EAEPSHAIEN 
FRIAAGLQEG EFYGMVFQDS DVAKWLEAVA WSLCQKPDAE LEKTADEVIE LIASAQCEDG
YLNTYFTVKA PEERWSNLAE CHELYCAGHL IEAGVAFFQA TGKRRLLEVV CRLADHIDRV
FGPDESKLHG YPGHPEIELA LMRLYEVTEE PRYLALTNYF VEQRGAQPHY YDQEYEKRGQ
TSHWHTYGPA WMVKDKAYSQ AHLSLAQQQT AIGHAVRFVY LMTGVAHLAR LSHDDSKRQD
CLRLWNNMAQ RQLYITGGIG SQSSGEAFTS DYDLPNDTVY AESCASIGLM MFARRMLEME
GDSQYADVME RALYNTVLGG MALDGKHFFY VNPLEVHPKS LKFNHIYDHV KPIRQRWFGC
ACCPPNIARV LTSIGHYLYT PREDALYINI YAGNSMEVPV ENGTLRLRVS GNYPWQEQVT
IAVESPQPVR HTLALRLPDW CTQPQIILNG EEVEQDIRKG YLHITREWQE GDTLNLTLPM
PVRRVYGNPL VRHVAGKVAI QRGPLVYCLE QADNGESLHN LWLPTDAPFT TFEGKGLFSH
KILIQAPGYR YEQSNPEQQP LWHYDSAPAK RQPQTLTFIP WFSWANRGEG EMRIWVNEEK
HRHPEVG