Gene B21_04100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04100 
SymbolintB 
ID8115794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4406434 
End bp4407624 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content46% 
IMG OID644850247 
Producthypothetical protein 
Protein accessionYP_003001820 
Protein GI251787516 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTGC TTGTCCATCC AAATGGTTCT AAGTACTGGC GTTTGCAGTA CCGTTATGAG 
GGAAAGCAAA AAATGCTGGC ACTTGGGGTT TATCCTGAAA TCACACTAGC GGATGCCAGA
GTACGTCGTG ACGAGACGCG TAAGCTGCTT GCGAATGGCG TCGATCCGGG AGACAAAAAG
AAAAATGATA AGGTTGAACA GAGTAAAGCA CGAACCTTTA AAGAAGTCGC GATTGAGTGG
CATGGCACCA ATAAAAAGTG GTCTGAAGAT CACGCCCATC GTGTGCTAAA AAGTCTTGAA
GATAATCTTT TTGCAGCGCT TGGTGAACGT AATATCGCTG AGTTAAAAAC TCGAGATTTA
TTAGCACCTA TTAAGGCCGT AGAAATGTCT GGACGTCTTG AAGTGGCCGC TCGTCTTCAG
CAGCGCACTA CAGCCATCAT GCGCTATGCA GTGCAAAGTG GGTTAATTGA TTATAACCCG
GCACAAGAGA TGGCTGGGGC GGTTGCTTCC TGTAATCGAC AACATCGTCC CGCGCTTGAA
TTAAAGCGCA TCCCTGAGTT GCTTACAAAA ATAGATAGCT ATACTGGTAG GCCGCTAACC
CGATGGGCGA TAGAACTCAC TTTGCTGATC TTTATTCGGT CCAGTGAGCT GCGTTTTGCT
CGTTGGTCAG AGATCGATTT CGAAGCGTCT ATATGGACTA TCCCACCGGA GCGGGAGCCT
ATTCCTGGAG TGAAACATTC CCATAGAGGC TCAAAAATGC GTACAACGCA TCTAGTGCCT
CTTTCAACGC AAGCTCTTGC AATTTTAAAG CAGATAAAAC AGTTTTATGG GGCCCATGAC
TTGATATTTA TTGGTGATCA CGATTCGCAC AAACCCATGA GTGAGAATAC GGTAAATAGT
GCGTTACGGG TCATGGGGTA TGATACAAAA GTAGAGGTTT GTGGTCATGG CTTTCGAACA
ATGGCCTGTA GTTCATTGGT CGAATCAGGT CTGTGGTCTC GTGATGCTGT TGAACGTCAG
ATGAGCCACA TGGCGCGAAA TTCAGTGAGG GCCGCGTATA TCCATAAAGC AGAGCATCTG
GAAGAACGGC GATTGATGCT ACAGTGGTGG GCCGATTTTC TGGATGTAAA CAGAGAAAGG
TTTATCAGTC CATTTGAATA TGCAAAGATT AATAATCCAT TAAAACAGTA A
 
Protein sequence
MHLLVHPNGS KYWRLQYRYE GKQKMLALGV YPEITLADAR VRRDETRKLL ANGVDPGDKK 
KNDKVEQSKA RTFKEVAIEW HGTNKKWSED HAHRVLKSLE DNLFAALGER NIAELKTRDL
LAPIKAVEMS GRLEVAARLQ QRTTAIMRYA VQSGLIDYNP AQEMAGAVAS CNRQHRPALE
LKRIPELLTK IDSYTGRPLT RWAIELTLLI FIRSSELRFA RWSEIDFEAS IWTIPPEREP
IPGVKHSHRG SKMRTTHLVP LSTQALAILK QIKQFYGAHD LIFIGDHDSH KPMSENTVNS
ALRVMGYDTK VEVCGHGFRT MACSSLVESG LWSRDAVERQ MSHMARNSVR AAYIHKAEHL
EERRLMLQWW ADFLDVNRER FISPFEYAKI NNPLKQ