Gene B21_02473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02473 
Symbolybl113 
ID8116776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2621732 
End bp2622961 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content46% 
IMG OID644848673 
Producthypothetical protein 
Protein accessionYP_003000246 
Protein GI251785942 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.36664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAA TCGCTAAGAA GCTCACTGAC ACTGAAATCA AAAGCACCAA ACCTGCCGAA 
AAAGAGGTTA ACCTTTTTGA CGGCGATGGT TTGCTGTTGC GAATCGCTCC CCTGGCGAAG
GGAGGGAAGA AAAATTGGTA TTTCAGATAT GCAGTGCCTG TGACCAAAAA GCGAACTAAG
GTGAGCTTAG GAACCTATCC TCACCTTACA CTTGCGAAGG CACGAGCTTT ACGTGATGAA
TACTTGTCGT TGCTTACAAA TGGTATAGAC CCCCAAGTTC ATAACAACCA AAAAGCCAAT
GCACTGAAAG ATGCCACGGA ACATACATTT CAAGCAGTAG CCAAGAAGTG GCTTGATGAG
AAAGTCAAAA CGTCAGGCAT CTCCCAAGAT CATGCTAACG ACATCTGGCG AAGCCTAGAG
AGAAATATCT TTTCCACATT GGGTGATACC CCAATTAAGG AGATTCGCCC TAAAATGCTT
AAACAGCATT TAGAACCCAT AGAAAAACGA GGTGTCCTTG AAACACTTCG CCGCATCATA
TCCCGCCTGA ATGAAATTTT CCGCTATGCA GCAACAGAAG AACTCATAGA ATTCAACCCG
GCAGACAACC TGGGGCAACG GTTCAGCAAG CCAAAAAAAC AGAATATGCC AGCATTACCC
CCTTCCGAAC TCCCTCGCTT CTTGGTTGCT CTAAACAATG CTTCTATCCG TTTGGAAACA
AGGCTACTGA TTGAGTGGCA ACTTCTCACA TGGGTTCGCC CAGGTGAAGC TGTTCGCACA
AGATGGTCAG ATATTGATAT TGAAACTGGC ATGTGGAACA TCCCGGCGGA GTTTATGAAA
ATGAAGAAGC CTCACAAAGT TCCACTGAGC AAAGAAGCTT TGCGAGTTTT GGATTTAATG
AAAGTCATCA GCGGGCATAG AGAGTGGGTG TTCCCCAGTA TCAAAGCTCC ACTCAATCAC
ATGCATGAAC AAACAGCTAA TGCGGCCATA ATCCGTATGG GTTTCGGAGG TGAGCTTGTA
GCTCACGGTA TGCGATCCAT TGCTAGAACG GCTGCTGAGG AGTCTGGCAA GTTTAGGACT
GATGTCTTAG AAGCCGCCCT TGCCCACTCG AAGAAAGATG AAATAATTGC AGCCTACAAT
CGTGCAGAGT ATCTCACTGA ACGGGTGGTT CTCATGCAAT GGTGGAGTGA CTATGTTTCG
TCTCAAAAAT GCAAAGTTAT TGCCGCATAA
 
Protein sequence
MAKIAKKLTD TEIKSTKPAE KEVNLFDGDG LLLRIAPLAK GGKKNWYFRY AVPVTKKRTK 
VSLGTYPHLT LAKARALRDE YLSLLTNGID PQVHNNQKAN ALKDATEHTF QAVAKKWLDE
KVKTSGISQD HANDIWRSLE RNIFSTLGDT PIKEIRPKML KQHLEPIEKR GVLETLRRII
SRLNEIFRYA ATEELIEFNP ADNLGQRFSK PKKQNMPALP PSELPRFLVA LNNASIRLET
RLLIEWQLLT WVRPGEAVRT RWSDIDIETG MWNIPAEFMK MKKPHKVPLS KEALRVLDLM
KVISGHREWV FPSIKAPLNH MHEQTANAAI IRMGFGGELV AHGMRSIART AAEESGKFRT
DVLEAALAHS KKDEIIAAYN RAEYLTERVV LMQWWSDYVS SQKCKVIAA