Gene B21_03056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03056 
SymbolyhdP 
ID8112813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3252822 
End bp3256622 
Gene Length3801 bp 
Protein Length1266 aa 
Translation table11 
GC content54% 
IMG OID644849240 
Producthypothetical protein 
Protein accessionYP_003000813 
Protein GI251786509 
COG category[S] Function unknown 
COG ID[COG3164] Predicted membrane protein 
TIGRFAM ID[TIGR02099] conserved hypothetical protein TIGR02099 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGCGAT TGCCGGGGAT TTTACTGCTT ACTGGAGCCG CGCTCGTTGT GATCGCTGCC 
CTGCTGGTTA GCGGCCTGCG TATTGCTTTA CCGCATCTTG ACGCCTGGCG TCCGGAAATC
CTCAACAAAA TAGAATCCGC GACTGGCATG CCGGTAGAAG CCAGTCAGCT CTCAGCCAGC
TGGCAGAATT TTGGCCCGAC GCTTGAAGCA CACGACATCC GTGCAGAACT AAAAGATGGC
GGCGAATTTT CGGTTAAACG CGTTACTCTG GCGCTGGATG TCTGGCAGAG CCTGTTACAT
ATGCGCTGGC AGTTTCGCGA CCTCACTTTC TGGCAGCTGC GCTTTCGCAC CAACACTCCT
ATCACCAGCG GTGGTAGTGA TGACAGTCTG GAAGCCAGTC ACATCAGCGA TCTGTTTCTT
CGTCAATTTG ACCATTTCGA TCTTCGCGAC AGTGAAGTCA GTTTCCTGAC GCCATCCGGT
CAGCGCGCCG AGCTGGCGAT CCCACAACTC ACCTGGCTGA ACGATCCACG TCGACACCGT
GCGGAAGGCC TGGTAAGCCT CTCCAGCCTT ACCGGACAGC ACGGCGTGAT GCAGGTGCGC
ATGGATTTGC GCGATGATGA GGGGTTGTTA AGCAATGGTC GCGTCTGGCT CCAGGCGGAT
GACATCGACC TGAAGCCGTG GCTCGGTAAA TGGATGCAGG ACAATATTGC GCTGGAAACG
GCACAGTTCT CCCTTGAAGG CTGGATGACG ATCGACAAAG GCGATGTAAC CGGCGGTGAC
GTCTGGCTGA AACAGGGCGG TGCCAGCTGG TTGGGCGAGA AGCAAACGCA TACGCTGTCG
GTGGATAATC TGACCGCGCA TATTACGCGT GAAAATCCGG GCTGGCAGTT CTCTATTCCC
GATACACGGA TCACGATGGA CGGCAAACCC TGGCCGAGCG GAGCATTGAC GCTGGCCTGG
ATACCGGAAC AGGACGTTGG CGGCAAAGAC AATAAACGCA GTGACGAACT CCGGATTCGC
GCCAGTAATC TGGAGCTGGC AGGCCTGGAG GGCATACGCC CGCTGGCCGC GAAACTTTCA
CCTGCACTGG GTGATGTTTG GCGCTCCACA CAACCGAGCG GCAAGATTAA CACTCTGGCG
CTGGATATCC CGCTTCAGGC GGCAGACAAG ACCCGTTTTC AGGCATCGTG GAGCGATCTG
GCCTGGAAGC AATGGAAATT ATTACCGGGT GCGGAACACT TCTCCGGGAC GCTTTCCGGC
AGCGTTGAAA ATGGTTTGCT TACCGCGTCG ATGAAGCAGG CAAAGATGCC TTACGAAACG
GTATTCCGTG CGCCACTGGA AATCGCCGAC GGCCAGGCAA CTATAAGCTG GCTGAACAAT
GACAAAGGTT TCCAGCTGGA TGGGAGTGAT ATTGACGTTA AAGCCAAAGC CGTCCATGCG
CGCGGCGGTT TTCGTTACCT GCAACCTGCT AACGATGAAC CCTGGCTGGG TATTCTGGCT
GGCATCAGTA CCGATGATGG TTCACAAGCC TGGCGCTATT TCCCGGAAAA CTTGATGGGT
AAAGACCTGG TTGATTACTT AAGTGGCGCG ATTCAGGGCG GTGAAGCGGA TAACGCGATG
CTGGTTTATG GTGGCAATCC GCAACTCTTC CCCTATAAAC ACAACGAAGG TCAGTTTGAA
GTGCTGGTGC CGCTGCGCAA CGCGAAGTTT GCCTTCCAGC CGGACTGGCC TGCATTAACT
AACCTTGATA TTGAACTGGA CTTTATTAAC GACGGTTTAT GGATGAAAAC CGATGGCGTT
AATCTGGGCG GCGTGCGCGC GAGTAATCTC ACCGCAGTGA TCCCTGACTA CTCGAAAGAA
AAACTGCTGA TTGACGCTGA CATTAAAGGT CCGGGTAAAG CCGTTGGCCC TTACTTTGAT
GAGACACCGC TGAAAGATTC TCTGGGTGCG ACCCTGCAAG AACTCCAGCT CGACGGCGAT
GTGAATGCTC GCTTACATCT TGATATCCCG CTGAACGGCG AGCTGGTAAC CGCGAAAGGT
GAAGTGACGC TGCGTAATAA CAGTCTGTTT ATCAAACCAC TCGCCAGCAC CCTGAAAAAT
TTGAGCGGTA AATTCAGCTT TATCAATGGC GATCTGCAAA GTGAACCACT GACAGCAAGC
TGGTTTAATC AGCCGTTGAA CGTGGATTTT TCCACCAAAG AAGGGGCAAA AGCCTACCAG
GTAGCGGTAA ACCTCAACGG TAACTGGCAA CCGGCGAAAA CCGGCGTTCT GCCTGCAGCG
GTGAACGAAG CATTGAGTGG CAGCGTGGCG TGGGATGGTA AAGTGGGCAT TGTTCTGCCT
TATCATGCTG GCGCGACGTA TAACGTAGAG CTAAACGGCG ATTTGAAGAA TGTGAGCAGT
CACTTACCTT CACCGTTAGC CAAACCTGCG GGTGAACCAC TGCCGGTAAA CGTTAAGGTT
GATGGCAATC TCAACAGCTT TGATTTAACC GGACAGGCTG GTGCGGATAA CCATTTCAAT
AGCCGCTGGT TGCTCGGTCA AAAGCTGACG CTCGACCGTG CTATTTGGGC GGCAGACAGT
AAAACGCTCC CGCCGTTGCC GGAACAAAGT GGTGTTGAAC TCAATATGCC GCCGATGAAT
GGTGCCGAGT GGCTGGCCCT GTTTCAGAAA GGTGCGGCGG AGAGTGTCGG TGGTGCAGCG
AGTTTCCCAC AACACATAAC GTTACGTACG CCTATGTTGT CGCTGGGAAA TCAGCAATGG
AATAACCTGA GTATTGTTTC GCAACCGACG GCAAATGGCA CCCAGGTTGA GGCGCAAGGG
CGTGAAATCA ACGCCACGCT GGCGATGCGT AATAACGCGC CGTGGCTGGC GAATATCAAA
TATCTTTATT ACAACCCGAG CGTGGCGAAA ACTCGTGGTG ATTCAACACC GTCATCACCT
TTCCCGACAA CGGAACGCAT TAACTTCCGT GGCTGGCCGG ACGCCCAAAT ACGATGCGCA
GAGTGCTGGT TCTGGGGGCA AAAATTCGGG CGCATTGACA GTGATATCAC TATTTCTGGC
AATACATTAA CGCTGACCAA TGGACTGATT GATACTGGTT TCTCGCGGCT TACTGCCGAT
GGTGAATGGG TTAATAATCC GGGGAATGAA CGTACCTCGC TGAAAGGAAA ACTGCGCGGG
CAGAAAATTG ATGCCGCCGC AGAATTTTTT GGTGTCACGA CGCCCATACG GCAGTCGTCA
TTTAATGTGG ATTACGATTT ACACTGGCGT AAAGCACCGT GGCAGCCAGA TGAGGCGACG
TTGAATGGCA TCATTCATAC TCAACTGGGT AAAGGCGAAA TTACCGAAAT CAATACCGGA
CATGCCGGGC AATTGCTGCG CTTATTGAGC GTAGATGCCC TGATGCGTAA GCTGCGTTTT
GATTTCAGAG ACACTTTTGG CGAAGGGTTC TATTTTGACT CCATTCGCAG CACCGCGTGG
ATTAAAGACG GCGTTATGCA CACCGACGAC ACGCTGGTGG ATGGCCTGGA GGCGGATATC
GCCATGAAAG GGTCGGTAAA TCTGGTACGT CGCGACCTGA ATATGGAAGC GGTTGTCGCA
CCAGAGATTT CTGCGACGGT GGGCGTGGCT GCGGCTTTTG CGGTTAACCC CATTGTTGGC
GCGGCAGTGT TTGCCGCCAG TAAAGTGCTG GGGCCGCTGT GGAGCAAAGT CTCCATTTTG
CGCTATCACA TTTCGGGTCC GCTGGACGAT CCGCAAATCA ACGAAGTGTT GCGCCAACCG
CGTAAAGAAA AAGCGCAATG A
 
Protein sequence
MRRLPGILLL TGAALVVIAA LLVSGLRIAL PHLDAWRPEI LNKIESATGM PVEASQLSAS 
WQNFGPTLEA HDIRAELKDG GEFSVKRVTL ALDVWQSLLH MRWQFRDLTF WQLRFRTNTP
ITSGGSDDSL EASHISDLFL RQFDHFDLRD SEVSFLTPSG QRAELAIPQL TWLNDPRRHR
AEGLVSLSSL TGQHGVMQVR MDLRDDEGLL SNGRVWLQAD DIDLKPWLGK WMQDNIALET
AQFSLEGWMT IDKGDVTGGD VWLKQGGASW LGEKQTHTLS VDNLTAHITR ENPGWQFSIP
DTRITMDGKP WPSGALTLAW IPEQDVGGKD NKRSDELRIR ASNLELAGLE GIRPLAAKLS
PALGDVWRST QPSGKINTLA LDIPLQAADK TRFQASWSDL AWKQWKLLPG AEHFSGTLSG
SVENGLLTAS MKQAKMPYET VFRAPLEIAD GQATISWLNN DKGFQLDGSD IDVKAKAVHA
RGGFRYLQPA NDEPWLGILA GISTDDGSQA WRYFPENLMG KDLVDYLSGA IQGGEADNAM
LVYGGNPQLF PYKHNEGQFE VLVPLRNAKF AFQPDWPALT NLDIELDFIN DGLWMKTDGV
NLGGVRASNL TAVIPDYSKE KLLIDADIKG PGKAVGPYFD ETPLKDSLGA TLQELQLDGD
VNARLHLDIP LNGELVTAKG EVTLRNNSLF IKPLASTLKN LSGKFSFING DLQSEPLTAS
WFNQPLNVDF STKEGAKAYQ VAVNLNGNWQ PAKTGVLPAA VNEALSGSVA WDGKVGIVLP
YHAGATYNVE LNGDLKNVSS HLPSPLAKPA GEPLPVNVKV DGNLNSFDLT GQAGADNHFN
SRWLLGQKLT LDRAIWAADS KTLPPLPEQS GVELNMPPMN GAEWLALFQK GAAESVGGAA
SFPQHITLRT PMLSLGNQQW NNLSIVSQPT ANGTQVEAQG REINATLAMR NNAPWLANIK
YLYYNPSVAK TRGDSTPSSP FPTTERINFR GWPDAQIRCA ECWFWGQKFG RIDSDITISG
NTLTLTNGLI DTGFSRLTAD GEWVNNPGNE RTSLKGKLRG QKIDAAAEFF GVTTPIRQSS
FNVDYDLHWR KAPWQPDEAT LNGIIHTQLG KGEITEINTG HAGQLLRLLS VDALMRKLRF
DFRDTFGEGF YFDSIRSTAW IKDGVMHTDD TLVDGLEADI AMKGSVNLVR RDLNMEAVVA
PEISATVGVA AAFAVNPIVG AAVFAASKVL GPLWSKVSIL RYHISGPLDD PQINEVLRQP
RKEKAQ