Gene B21_03712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03712 
SymbolyihQ 
ID8114944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3972013 
End bp3973977 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content54% 
IMG OID644849873 
Producthypothetical protein 
Protein accessionYP_003001446 
Protein GI251787142 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACAACGTC TTATTTTAAC CCATAGCAAA GATAATCCTT GTTTATGGAT TGGCTCAGGT 
ATAGCGGATA TCGATATGTT CCGCGGTAAT TTCAGCATTA AAGATAAACT ACAGGAGAAA
ATTGCGCTTA CCGACGCCAT CGTCAGCCAG TCACCGGATG GTTGGTTAAT TCATTTCAGC
CGTGGTTCTG ACATTAGCGC CACGCTGAAT ATCTCTGCCG ACGATCAGGG GCGTTTATTG
CTGGAACTAC AAAACGACAA CCTTAACCAC AACCGTATCT GGCTGCGCCT TGCCGCTCAA
CCAGAGGACC ATATCTACGG CTGCGGCGAA CAGTTTTCCT ACTTCGATCT GCGTGGCAAA
CCGTTCCCGC TATGGACCAG TGAACAAGGC GTTGGTCGCA ACAAACAAAC CTATGTCACC
TGGCAGGCCG ACTGCAAAGA AAATGCGGGC GGCGACTATT ACTGGACTTT CTTCCCACAG
CCTACGTTTG TCAGCACGCA GAAGTATTAC TGCCATGTTG ATAACAGTTG CTATATGAAC
TTCGACTTTA GTGCCCCGGA ATACCATGAA CTGGCGCTGT GGGAAGACAA AGCAACGCTG
CGTTTTGAAT GTGCTGACAC ATACATTTCC CTGCTGGAAA AATTAACCGC CCTGCTGGGA
CGCCAGCCAG AACTGCCCGA CTGGATTTAT GACGGAGTAA CGCTCGGCAT TCAGGGCGGG
ACGGAAGTGT GCCAGAAGAA ACTGGACACC ATGCGTAACG CGGGCGTGAA GGTCAACGGC
ATCTGGGCGC AGGACTGGTC CGGTATTCGT ATGACCTCTT TTGGCAAACG CGTGATGTGG
AACTGGAAGT GGAACAGCGA AAACTACCCG CAACTGGATT CACGCATTAA GCAGTGGAAT
CAGGAGGGCG TGCAGTTCCT GGCCTATATC AACCCGTATG TTGCCAGCGA TAAAGATCTC
TGCGAAGAAG CGGCACAACA CGGCTATCTG GCAAAAGATG CCTCTGGCGG TGACTATCTG
GTGGAGTTTG GCGAGTTTTA CGGCGGCGTT GTCGATCTCA CTAATCCAGA AGCCTACGCC
TGGTTCAAGG AAGTGATCAA AAAGAACATG ATTGAACTCG GCTGCGGCGG CTGGATGGCT
GACTTCGGCG AGTATCTGCC CACCGACACG TACCTGCATA ACGGCGTCAG TGCCGAAATT
ATGCATAACG CCTGGCCTGC GCTGTGGGCG AAGTGTAACT ACGAAGCCCT TGAAGAAACG
GGCAAGCTCA GCGAGATCCT TTTCTTTATG CGCGCCGGTT CTACCGGTAG CCAGAAATAC
TCCACCATGA TGTGGGCGGG CGACCAGAAC GTCGACTGGA GTCTCGACGA TGGCCTGGCG
TCGGTTGTCC CGGCGGCGCT GTCGCTGGCA ATGACCGGAC ATGGCCTGCA CCACAGCGAC
ATTGGCGGTT ACACCACCCT GTTTGAGATG AAGCGCAGCA AAGAGCTGCT GCTGCGCTGG
TGCGATTTCA GCGCCTTCAC GCCGATGATG CGCACCCACG AAGGTAACCG TCCTGGCGAC
AACTGGCAGT TTGACGGCGA CGCAGAAACC ATCGCCCATT TCGCCCGTAT GACCACCGTC
TTCACCACCC TGAAACCTTA CCTGAAAGAG GCCGTCGCGC TGAATGCGAA GTCCGGCCTG
CCGGTTATGC GCCCGCTGTT CCTGCATTAC GAAGACGATG CGCACACTTA CACCCTGAAA
TATCAGTACC TGTTAGGTCG CGACATTCTG GTCGCTCCGG TGCATGAAGA AGGCCGTAGC
GACTGGACGC TCTATCTGCC GGAGGATAAC TGGGTCCACG CCTGGACGGG TGAAGCGTTC
CGGGGCGGGG AAGTTACCGT TAATGCGCCC ATCGGCAAGC CGCCGGTCTT TTATCGCGCC
GATAGCGAAT GGGCGGCACT GTTCGCGTCG TTAAAAAGCA TCTAA
 
Protein sequence
QQRLILTHSK DNPCLWIGSG IADIDMFRGN FSIKDKLQEK IALTDAIVSQ SPDGWLIHFS 
RGSDISATLN ISADDQGRLL LELQNDNLNH NRIWLRLAAQ PEDHIYGCGE QFSYFDLRGK
PFPLWTSEQG VGRNKQTYVT WQADCKENAG GDYYWTFFPQ PTFVSTQKYY CHVDNSCYMN
FDFSAPEYHE LALWEDKATL RFECADTYIS LLEKLTALLG RQPELPDWIY DGVTLGIQGG
TEVCQKKLDT MRNAGVKVNG IWAQDWSGIR MTSFGKRVMW NWKWNSENYP QLDSRIKQWN
QEGVQFLAYI NPYVASDKDL CEEAAQHGYL AKDASGGDYL VEFGEFYGGV VDLTNPEAYA
WFKEVIKKNM IELGCGGWMA DFGEYLPTDT YLHNGVSAEI MHNAWPALWA KCNYEALEET
GKLSEILFFM RAGSTGSQKY STMMWAGDQN VDWSLDDGLA SVVPAALSLA MTGHGLHHSD
IGGYTTLFEM KRSKELLLRW CDFSAFTPMM RTHEGNRPGD NWQFDGDAET IAHFARMTTV
FTTLKPYLKE AVALNAKSGL PVMRPLFLHY EDDAHTYTLK YQYLLGRDIL VAPVHEEGRS
DWTLYLPEDN WVHAWTGEAF RGGEVTVNAP IGKPPVFYRA DSEWAALFAS LKSI