Gene B21_03861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03861 
SymbolyjbH 
ID8115738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4143095 
End bp4145191 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content54% 
IMG OID644850017 
Producthypothetical protein 
Protein accessionYP_003001590 
Protein GI251787286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GACATCTGCT TAGCTTACTG GCGCTGGGCA TTAGCATGGC TTGCTACGGC 
GAAACATATC CTGCGCCCAT TGGCCCGTCG CAGTCGGATT TCGGTGGCGT AGGATTATTA
CAAACGCCCA CCGCACGCAT GGCGCGGGAA GGGGAGTTGA GCCTGAACTA TCGCGATAAC
GATCAGTACC GTTATTACTC AGCTTCAGTG CAACTCTTCC CGTGGCTGGA AACAACGCTG
CGCTACACCG ACGTGCGCAC CCGGCAGTAC AGCAGCGTCG AAGCGTTCTC TGGCGATCAA
ACGTATAAAG ATAAAGCCTT CGATCTCAAA CTGCGTTTGT GGGAAGAGAG TTACTGGCTG
CCGCAAGTGG CGGTTGGTGC GCGGGATATC GGCGGTACGG GGCTGTTTGA TGCGGAATAT
CTTGTTGCCA GCAAAGCCTG GGGGCCGTTC GATTTTACGC TCGGCCTGGG CTGGGGATAT
TTGGGCACCA GCGGTAATGT GAAAAATCCG CTCTGTTCAG CCAGTGATAA ATATTGCTAT
CGCGATAACA GCTACAAACA GGCGGGATCT ATCGACGGTA GCCAGATGTT CCACGGTCCT
GCCTCACTGT TTGGCGGCGT GGAATACCAG ACGCCCTGGC AACCGCTGCG CCTGAAACTG
GAGTATGAAG GCAATAATTA TCAGCAGGAT TTTGCCGGGA AGCTGGAGCA AAAAAGTAAG
TTTAACGTCG GTGCGATTTA TCGCGTTACC GATTGGGCCG ACGTTAACCT TAGCTATGAA
CGTGGCAACA CCTTTATGTT TGGCGTTACG TTGCGCACCA ACTTTAACGA TCTGCGCCCG
TCTTACAACG ATAACGCCCG CCCGCAATAT CAACCGCAGC CGCAGGATGC CATTTTGCAG
CATTCGGTGG TGGCGAATCA GTTAACGCTG TTGAAATACA ATGCTGGACT TGCCGATCCA
CAGATCCAGG CGAAAGGCGA TACGCTGTAT GTTACCGGCG AGCAGGTGAA ATATCGTGAT
TCGCGCGAAG GGATCATCCG TGCGAATCGG ATCGTGATGA ACGATCTGCC GGATGGGATC
AAAACGATCC GCATTACGGA AAATCGCCTT AACATGCCGC AGGCGACGAC GGAAACCGAT
GTCGCCAGCC TGAAAAATCA TCTCGCCGGA GAGCCGTTGG GCCACGAAAC GACGCTGGCG
CAAAAACGCG TCGAGCCAGT GGTTCCGCAG TCCACCGAGC AGGGCTGGTA TATCGACAAA
TCACGCTTTG ATTTCCATAT CGATCCGGTG CTGAACCAGT CGGTCGGTGG CCCGGAAAAC
TTTTACATGT ATCAGCTGGG CGTGATGGGA ACGGCAGATT TGTGGCTGAC GGACCATCTG
CTGACCACCG GCAGCCTGTT TGCAAATCTT GCCAACAACT ACGACAAGTT TAACTACACT
AATCCTCCGC AGGACTCGCA CTTACCGCGC GTGCGTACCC ATGTGCGCGA GTATGTGCAG
AACGATGTCT ATGTGAATAA CCTGCAAGCC AACTACTTCC AGCATCTGGG CAACGGCTTC
TACGGTCAGG TCTACGGTGG TTATCTCGAA ACCATGTTTG GCGGTGCGGG GGCAGAAGTG
TTGTATCGCC CGCTGGACAG CAACTGGGCG TTTGGTCTGG ATGCCAACTA CGTTAAACAG
CGCGACTGGC GTAGTGCAAA AGATATGATG AAATTCACCG ACTACAGCGT GAAAACCGGA
CATCTGACCG CCTACTGGAC GCCATCTTTC GCTCAGGATG TGTTAGTTAA AGCCAGCGTC
GGGCAGTATC TGGCAGGGGA TAAAGGCGGC ACGCTGGAGA TCGCCAAACG CTTTGATAGC
GGCGTGGTGG TGGGTGGCTA TGCCACGATC ACTAATGTTT CGAAAGAGGA GTACGGCGAA
GGGGACTTCA CCAAAGGCGT GTATGTCTCG GTACCGTTGG ATCTCTTCTC GTCTGGCCCG
ACACGCAGCC GTGCGGCGAT TGGCTGGACG CCGCTGACGC GTGACGGTGG TCAGCAACTT
GGGCGTAAGT TCCAGTTGTA TGACATGACC AGCGACCGTA GCGTCAATTT CCGCTAA
 
Protein sequence
MKKRHLLSLL ALGISMACYG ETYPAPIGPS QSDFGGVGLL QTPTARMARE GELSLNYRDN 
DQYRYYSASV QLFPWLETTL RYTDVRTRQY SSVEAFSGDQ TYKDKAFDLK LRLWEESYWL
PQVAVGARDI GGTGLFDAEY LVASKAWGPF DFTLGLGWGY LGTSGNVKNP LCSASDKYCY
RDNSYKQAGS IDGSQMFHGP ASLFGGVEYQ TPWQPLRLKL EYEGNNYQQD FAGKLEQKSK
FNVGAIYRVT DWADVNLSYE RGNTFMFGVT LRTNFNDLRP SYNDNARPQY QPQPQDAILQ
HSVVANQLTL LKYNAGLADP QIQAKGDTLY VTGEQVKYRD SREGIIRANR IVMNDLPDGI
KTIRITENRL NMPQATTETD VASLKNHLAG EPLGHETTLA QKRVEPVVPQ STEQGWYIDK
SRFDFHIDPV LNQSVGGPEN FYMYQLGVMG TADLWLTDHL LTTGSLFANL ANNYDKFNYT
NPPQDSHLPR VRTHVREYVQ NDVYVNNLQA NYFQHLGNGF YGQVYGGYLE TMFGGAGAEV
LYRPLDSNWA FGLDANYVKQ RDWRSAKDMM KFTDYSVKTG HLTAYWTPSF AQDVLVKASV
GQYLAGDKGG TLEIAKRFDS GVVVGGYATI TNVSKEEYGE GDFTKGVYVS VPLDLFSSGP
TRSRAAIGWT PLTRDGGQQL GRKFQLYDMT SDRSVNFR