Gene B21_00395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00395 
Symbollon 
ID8112943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp427823 
End bp430177 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content52% 
IMG OID644846679 
Producthypothetical protein 
Protein accessionYP_002998252 
Protein GI251783948 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCTG AGCGTTCTGA ACGCATTGAA ATCCCCGTAT TGCCGCTGCG CGATGTGGTG 
GTTTATCCGC ACATGGTCAT CCCCTTATTT GTCGGGCGGG AAAAATCTAT CCGTTGTCTG
GAAGCGGCGA TGGACCATGA TAAAAAAATT ATGCTGGTCG CGCAGAAAGA AGCTTCAACG
GATGAGCCGG GTGTAAACGA TCTTTTCACC GTCGGGACCG TGGCCTCTAT ATTGCAGATG
CTGAAACTGC CTGACGGCAC CGTCAAAGTG CTGGTCGAGG GGTTACAGCG CGCGCGTATT
TCTGCGCTCT CTGACAATGG CGAACACTTT TCTGCGAAGG CGGAGTATCT GGAGTCGCCG
ACCATTGATG AGCGGGAACA GGAAGTGCTG GTGCGTACTG CAATCAGCCA GTTCGAAGGC
TACATCAAGC TGAACAAAAA AATCCCACCA GAAGTGCTGA CGTCGCTGAA TAGCATCGAC
GATCCGGCGC GTCTGGCGGA TACCATTGCT GCACATATGC CGCTGAAACT GGCTGACAAA
CAGTCTGTTC TGGAGATGTC CGACGTTAAC GAACGTCTGG AATATCTGAT GGCAATGATG
GAATCGGAAA TCGATCTGCT GCAGGTTGAG AAACGCATTC GCAACCGCGT TAAAAAGCAG
ATGGAGAAAT CCCAGCGTGA GTACTATCTG AACGAGCAAA TGAAAGCTAT TCAGAAAGAA
CTCGGTGAAA TGGACGACGC GCCGGACGAA AACGAAGCCC TGAAGCGCAA AATCGACGCG
GCGAAGATGC CGAAAGAGGC AAAAGAGAAA GCGGAAGCAG AGTTGCAGAA ACTGAAAATG
ATGTCTCCGA TGTCGGCAGA AGCGACCGTA GTGCGTGGTT ATATCGACTG GATGGTACAG
GTGCCGTGGA ATGCGCGTAG CAAGGTCAAA AAAGACCTGC GTCAGGCGCA GGAAATCCTT
GATACCGACC ATTATGGTCT GGAGCGCGTG AAAGATCGAA TCCTTGAGTA TCTTGCGGTT
CAAAGCCGTG TCAACAAAAT CAAGGGACCG ATCCTCTGCC TGGTAGGGCC GCCGGGGGTA
GGTAAAACCT CTCTTGGTCA GTCCATTGCC AAAGCCACCG GGCGTAAATA TGTCCGTATG
GCGCTGGGCG GCGTGCGTGA TGAAGCGGAA ATCCGTGGTC ACCGCCGTAC TTACATCGGT
TCTATGCCGG GTAAACTGAT CCAGAAAATG GCGAAAGTGG GCGTGAAAAA CCCGCTGTTC
CTGCTCGATG AGATCGACAA AATGTCTTCT GACATGCGTG GCGATCCGGC CTCTGCACTG
CTTGAAGTGC TGGATCCAGA GCAGAACGTA GCGTTCAGCG ACCACTACCT GGAAGTGGAT
TACGATCTCA GCGACGTGAT GTTTGTCGCG ACGTCGAACT CCATGAACAT TCCGGCACCG
CTGCTGGATC GTATGGAAGT GATTCGCCTC TCCGGTTATA CCGAAGATGA AAAACTGAAC
ATCGCCAAAC GTCACCTGCT GCCGAAGCAG ATTGAACGTA ATGCACTGAA AAAAGGTGAG
CTGACCGTCG ACGATAGCGC CATTATCGGC ATTATTCGTT ACTACACCCG TGAGGCGGGC
GTGCGTGGTC TGGAGCGTGA AATCTCCAAA CTGTGTCGCA AAGCGGTTAA GCAGTTACTG
CTCGATAAGT CATTAAAACA TATCGAAATT AACGGCGATA ACCTGCATGA CTATCTCGGT
GTTCAGCGTT TCGACTATGG TCGCGCGGAT AACGAAAACC GTGTCGGTCA GGTAACCGGT
CTGGCGTGGA CGGAAGTGGG CGGTGACTTG CTGACCATTG AAACCGCATG TGTTCCGGGT
AAAGGCAAAC TGACCTATAC CGGTTCGCTC GGCGAAGTGA TGCAGGAGTC CATTCAGGCG
GCGTTAACGG TGGTTCGTGC GCGTGCGGAA AAACTGGGGA TCAACCCTGA TTTTTACGAA
AAACGCGACA TCCACGTCCA CGTACCGGAA GGTGCGACGC CGAAAGATGG TCCGAGTGCC
GGTATTGCTA TGTGCACCGC GCTGGTTTCT TGCCTGACCG GTAACCCGGT TCGTGCCGAT
GTGGCAATGA CCGGTGAGAT CACTCTGCGT GGTCAGGTAC TGCCGATCGG TGGTTTGAAA
GAAAAACTCC TGGCAGCGCA TCGCGGCGGG ATTAAAACAG TGTTAATTCC GTTCGAAAAT
AAACGCGATC TGGAAGAGAT TCCTGACAAC GTAATTGCCG ATCTGGACAT TCATCCTGTG
AAGCGCATTG AGGAAGTTCT GACTCTGGCG CTGCAAAATG AACCGTCTGG CATGCAGGTT
GTGACTGCAA AATAG
 
Protein sequence
MNPERSERIE IPVLPLRDVV VYPHMVIPLF VGREKSIRCL EAAMDHDKKI MLVAQKEAST 
DEPGVNDLFT VGTVASILQM LKLPDGTVKV LVEGLQRARI SALSDNGEHF SAKAEYLESP
TIDEREQEVL VRTAISQFEG YIKLNKKIPP EVLTSLNSID DPARLADTIA AHMPLKLADK
QSVLEMSDVN ERLEYLMAMM ESEIDLLQVE KRIRNRVKKQ MEKSQREYYL NEQMKAIQKE
LGEMDDAPDE NEALKRKIDA AKMPKEAKEK AEAELQKLKM MSPMSAEATV VRGYIDWMVQ
VPWNARSKVK KDLRQAQEIL DTDHYGLERV KDRILEYLAV QSRVNKIKGP ILCLVGPPGV
GKTSLGQSIA KATGRKYVRM ALGGVRDEAE IRGHRRTYIG SMPGKLIQKM AKVGVKNPLF
LLDEIDKMSS DMRGDPASAL LEVLDPEQNV AFSDHYLEVD YDLSDVMFVA TSNSMNIPAP
LLDRMEVIRL SGYTEDEKLN IAKRHLLPKQ IERNALKKGE LTVDDSAIIG IIRYYTREAG
VRGLEREISK LCRKAVKQLL LDKSLKHIEI NGDNLHDYLG VQRFDYGRAD NENRVGQVTG
LAWTEVGGDL LTIETACVPG KGKLTYTGSL GEVMQESIQA ALTVVRARAE KLGINPDFYE
KRDIHVHVPE GATPKDGPSA GIAMCTALVS CLTGNPVRAD VAMTGEITLR GQVLPIGGLK
EKLLAAHRGG IKTVLIPFEN KRDLEEIPDN VIADLDIHPV KRIEEVLTLA LQNEPSGMQV
VTAK