Gene B21_00851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00851 
Symbolybl47 
ID8115118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp887859 
End bp889481 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content51% 
IMG OID644847115 
Producthypothetical protein 
Protein accessionYP_002998688 
Protein GI251784384 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAA AATTTTATAC CCTGCTGACG GATATTGGCG CGGCGAAACT TGCCAGCGCC 
GCCGCGCTCG GTGTGCCTTT AAAAATTACC CATATGGCGG TCGGCGATGG CGGCGGAGTA
TTGCCAACGC CGGATGCAAA GCAGACTGCA CTGGTGAATG AGAAACGCCG GGCTGCGCTG
AATATGCTCT ATATCGACCC GCAGAACAGT AGCCAGATTA TTGCTGAACA GGTAATCCCT
GAAAATGAGG GCGGTTGGTG GATACGTGAA GTGGGCCTGT TTGATGAGTC CGGGGCATTG
ATTGCCGTGG GAAACTGCCC GGAAAGCTAT AAGCCGCAAC TGGCTGAAGG CAGTGGGCGT
ACCCAGACCG TGCGCATGGT GCTGATTACC AGCAGCACGG ACAATATCAC CCTGAAAATC
GACCCTGCCG TCGTGCTGGC AACCCGCAAG TATGTGGATG ACAAGGTACT GGAGCTGAAG
GTGTTCGTGG ATGATAAGAT GGCAAAACAT CTTGCCGCAC CGGACCCGCA TTCACAGTAT
GCACCCAAAG AAAGCCCGAC ATTGACCGGA ACACCCAAAG CGCCAACGCC AGCGGAGGGG
AATAACACCA CGCAGATTGC GACCACCGCG TTTGTTCAGG CGGCACTGAT GGCCCTTATT
AATGGTGCGC CAGCCACACT GGATACGATG AAAGAAATTG CCGCTGCCAT TAATAATGAC
CCGAAATTCA GTACCACCAT TAACAATGCG CTGGCACTGA AAGCGCCGCT GTTAAGTCCG
GCATTCACCG GAACGCCAAC AGCCCCCACT GCCGCACAGT CGGTTAACAA TACACAGATT
GCCACCACGG CTTTTGTGAA ATCGGCAATT GCGGCAATGG TGGGGTCTGC ACCTGCTGCA
CTGGATACAC TGAACGAACT GGCTGCGGCG CTGGGGAATG ACCCGAACTT TGCCACGACA
ATGCTTAATG CGCTGGCAGG TAAACAACCG TTGGACAATA CGCTGACTAA TTTGAGTGGA
AAGGATGTAG CTGGTCTTCT CGCATACCTT GGTTTGGGAG ATGCATTAAT TGGTGATGAA
TGTAAAATTG CAGGGTTTGA CAGTAGTAAC GTCAATGCCC CGTATATGCG ATTCGCCAGA
ACAAATACAG TAGTTCGTCT GGCAACAAAA GACTATGCGC AACCAAAAGA CCAGACACTG
ACAGATTTAA GCGGTAAGGA TAAGGCTGAA CTAAGAACTT ATCTTGATCT GAAAAGTGCG
GCTCAAAGGG ATGTTGGCTC AGGGGCAAAT CAGATTCCGG ATATGAATGA CTTCACATCC
AGCCTGACCA GCCCTGGCTG GCAAAAATTA CCGTCAGGTC TGATTATTCA GTGGGGGGCA
GCCAATCCAT CATCAACTGG AGAGATCTTT ATTACGTTTC CTGTCGCGTT CTCTGCATAC
CCGATGTATG TGGGATTTGG TCCTCAGCAG GCATCGCTTC CTAACGTAGT TCAGTCGCCA
GTAATTTCAG CGCCAACGAT AACTAATTTA GGATGCGGCG TCCGAAATCT GATGATTCCA
ACAGCGGGCG GAGCACCAGT AGCCAGCATG AGTTCATTTT TCTGGATTGC GGTAGGGAAA
TAA
 
Protein sequence
MSTKFYTLLT DIGAAKLASA AALGVPLKIT HMAVGDGGGV LPTPDAKQTA LVNEKRRAAL 
NMLYIDPQNS SQIIAEQVIP ENEGGWWIRE VGLFDESGAL IAVGNCPESY KPQLAEGSGR
TQTVRMVLIT SSTDNITLKI DPAVVLATRK YVDDKVLELK VFVDDKMAKH LAAPDPHSQY
APKESPTLTG TPKAPTPAEG NNTTQIATTA FVQAALMALI NGAPATLDTM KEIAAAINND
PKFSTTINNA LALKAPLLSP AFTGTPTAPT AAQSVNNTQI ATTAFVKSAI AAMVGSAPAA
LDTLNELAAA LGNDPNFATT MLNALAGKQP LDNTLTNLSG KDVAGLLAYL GLGDALIGDE
CKIAGFDSSN VNAPYMRFAR TNTVVRLATK DYAQPKDQTL TDLSGKDKAE LRTYLDLKSA
AQRDVGSGAN QIPDMNDFTS SLTSPGWQKL PSGLIIQWGA ANPSSTGEIF ITFPVAFSAY
PMYVGFGPQQ ASLPNVVQSP VISAPTITNL GCGVRNLMIP TAGGAPVASM SSFFWIAVGK