Gene B21_02630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02630 
Symbolptr 
ID8115619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2793484 
End bp2796372 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content51% 
IMG OID644848827 
Producthypothetical protein 
Protein accessionYP_003000400 
Protein GI251786096 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGCA GCACCTGGTT CAAAGCATTA TTGTTGTTTG TTGCCCTTTG GGCACCCTTA 
AGTCAGGCAG AAACGGGATG GCAGCCGATT CAGGAAACCA TCCGTAAAAG TGATAAAGAT
AACCGCCAGT ATCAGGCTAT ACGTCTGGAT AACGGTATGG TGGTCTTGCT GGTTTCTGAT
CCGCAGGCGG TTAAATCGCT CTCGGCGCTG GTGGTGCCCG TTGGGTCGCT GGAAGATCCC
GAGGCGTACC AGGGGCTGGC ACATTACCTT GAACATATGA GTCTGATGGG GTCGAAAAAG
TACCCGCAGG CTGACAGTCT GGCCGAATAT CTCAAAATGC ACGGCGGTAG TCACAATGCC
AGCACTGCGC CGTATCGCAC GGCTTTCTAT CTGGAAGTTG AGAACGACGC CTTGCCTGGT
GCGGTAGACC GCCTGGCCGA TGCTATTGCT GAACCTTTGC TCGACAAGAA ATATGCCGAA
CGTGAGCGTA ATGCGGTGAA CGCTGAATTA ACCATGGCGC GTACGCGTGA CGGGATGCGC
ATGGCACAGG TCAGCGCAGA AACCATTAAC CCGGCACACC CCGGTTCAAA GTTTTCTGGT
GGTAACCTCG AAACTTTAAG CGACAAACCT GGTAATCCGG TGCAGCAGGC GCTGAAAGAT
TTCCACGAGA AGTACTATTC CGCCAATTTG ATGAAGGCGG TTATTTACAG TAATAAACCG
CTGCCGGAGT TGGCAAAAAT GGCGGCGGAC ACCTTTGGTC GCGTGCCGAA CAAAGAGAGC
AAAAAACCGG AAATCACCGT GCCGGTAGTC ACCGACGCGC AAAAGGGCAT TATCATTCAT
TACGTCCCGG CGCTGCCGCG TAAAGTGTTG CGCGTAGAGT TTCGTATTGA TAACAACTCG
GCGAAATTCC GTAGCAAAAC CGATGAATTG ATTACCTATC TGATTGGTAA TCGCAGCCCA
GGTACACTTT CCGACTGGCT GCAAAAGCAG GGATTAGTTG AGGGCATTAG CGCCAACTCC
GATCCTATCG TCAACGGCAA CAGCGGCGTA TTAGCGATCT CTGCGTCTTT AACCGATAAA
GGTCTGGCGA ATCGCGATCA GGTTGTGGCG GCTATTTTTA GCTACCTCAA TTTGTTACGT
GAAAAAGGGA TCGATAAACA ATACTTCGAT GAACTGGCGA ATGTGCTGGA TATCGACTTC
CGTTATCCGT CAATCACCCG TGATATGGAT TACGTCGAAT GGCTGGCAGA TACCATGATT
CGCGTTCCTG TTGAGCATAC GCTGGATGCA GTCAATATTG CCGATCGGTA CGATGCTAAA
GCAGTAAAGG AACGTCTGGC GATGATGACG CCGCAGAATG CGCGTATCTG GTATATCAGC
CCGAAAGAGC CGCACAACAA AACGGCTTAC TTTGTCGATG CGCCGTATCA GGTCGATAAA
ATCAGCGCAC AAACTTTCGC CGACTGGCAG AAAAAAGCCG CCGACATTGC GCTCTCTTTG
CCAGAGCTTA ACCCTTATAT TCCTGACGAT TTCTCGCTGA TTAAGTCAGA CAAGAAATAT
GACCATCCAG AGCTGATTGT TGATGAGTCG AATCTGCGCG TAGTGTATGC GCCAAGCCGT
TATTTTGCCA GCGAGCCAAA AGCCGATGTC AGCCTGATTT TGCGTAATCC GAAAGCCATG
GACAGCGCCC GCAATCAGGT GATGTTTGCG CTCAATGATT ATCTCGCAGG GCTGGCGCTT
GATCAGTTAA GCAACCAGGC GTCGGTTGGT GGCATAAGTT TTTCCACCAA CGCTAACAAC
GGCCTTATGG TTAATGCTAA TGGTTACACC CAGCGTCTGC CGCAGCTGTT CCAGGCATTG
CTGGAAGGCT ACTTTAGCTA TACCGCCACG GAAGATCAGC TTGAGCAGGC GAAGTCCTGG
TATAACCAGA TGATGGATTC CGCAGAAAAG GGTAAAGCGT TTGAGCAGGC GATTATGCCC
GCGCAGATGC TCTCGCAAGT GCCGTACTTC TCGCGAGATG AACGGCGTAA AATTTTGCCC
TCCATTACGT TGAAAGAGGT GCTGGCCTAT CGCGACGCCT TAAAATCAGG GGCTCGACCA
GAGTTTATGG TTATCGGCAA CATGACCGAG GCCCAGGCAA CAACGCTGGC ACGCGATGTG
CAAAAACAGT TGGGCGCTGA TGGTTCAGAG TGGTGTCGAA ACAAAGATGT AGTGGTCGAT
AAAAAACAAT CCGTCATCTT TGAAAAAGCC GGTAACAGCA CCGACTCCGC ACTGGCAGCG
GTATTTGTAC CGACTGGCTA CGATGAATAC ACCAGCTCAG CCTATAGCTC TCTGTTGGGG
CAGATCGTAC AGCCGTGGTT CTACAATCAG TTGCGTACCG AAGAACAATT GGGCTATGCC
GTGTTTGCGT TTCCAATGAG CGTGGGGCGT CAGTGGGGCA TGGGCTTCCT TTTGCAAAGC
AATGATAAAC AGCCTTCATT CTTGTGGGAG CGTTACAAGG CGTTTTTCCC AACCGCAGAG
GCAAAATTGC GAGCGATGAA GCCAGATGAG TTTGCGCAAA TCCAGCAGGC GGTAATTACC
CAGATGCTGC AGGCACCGCA AACGCTCGGC GAAGAAGCAT CGAAGTTAAG TAAAGATTTC
GATCGCGGCA ATATGCGCTT CGATTCGCGT GATAAAATCG TGGCCCAGAT AAAACTGCTG
ACGCCGCAAA AACTTGCTGA TTTCTTCCAT CAGGCGGTGG TCGAGCCGCA AGGCATGGCT
ATTCTGTCGC AGATTTCCGG CAGCCAGAAC GGGAAAGCCG AATATGTACA CCCTGAAGGC
TGGAAAGTGT GGGAGAACGT CAGCGCGTTG CAGCAAACAA TGCCCCTGAT GAGTGAAAAG
AATGAGTGA
 
Protein sequence
MPRSTWFKAL LLFVALWAPL SQAETGWQPI QETIRKSDKD NRQYQAIRLD NGMVVLLVSD 
PQAVKSLSAL VVPVGSLEDP EAYQGLAHYL EHMSLMGSKK YPQADSLAEY LKMHGGSHNA
STAPYRTAFY LEVENDALPG AVDRLADAIA EPLLDKKYAE RERNAVNAEL TMARTRDGMR
MAQVSAETIN PAHPGSKFSG GNLETLSDKP GNPVQQALKD FHEKYYSANL MKAVIYSNKP
LPELAKMAAD TFGRVPNKES KKPEITVPVV TDAQKGIIIH YVPALPRKVL RVEFRIDNNS
AKFRSKTDEL ITYLIGNRSP GTLSDWLQKQ GLVEGISANS DPIVNGNSGV LAISASLTDK
GLANRDQVVA AIFSYLNLLR EKGIDKQYFD ELANVLDIDF RYPSITRDMD YVEWLADTMI
RVPVEHTLDA VNIADRYDAK AVKERLAMMT PQNARIWYIS PKEPHNKTAY FVDAPYQVDK
ISAQTFADWQ KKAADIALSL PELNPYIPDD FSLIKSDKKY DHPELIVDES NLRVVYAPSR
YFASEPKADV SLILRNPKAM DSARNQVMFA LNDYLAGLAL DQLSNQASVG GISFSTNANN
GLMVNANGYT QRLPQLFQAL LEGYFSYTAT EDQLEQAKSW YNQMMDSAEK GKAFEQAIMP
AQMLSQVPYF SRDERRKILP SITLKEVLAY RDALKSGARP EFMVIGNMTE AQATTLARDV
QKQLGADGSE WCRNKDVVVD KKQSVIFEKA GNSTDSALAA VFVPTGYDEY TSSAYSSLLG
QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSFLWE RYKAFFPTAE
AKLRAMKPDE FAQIQQAVIT QMLQAPQTLG EEASKLSKDF DRGNMRFDSR DKIVAQIKLL
TPQKLADFFH QAVVEPQGMA ILSQISGSQN GKAEYVHPEG WKVWENVSAL QQTMPLMSEK
NE