Gene B21_03870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03870 
SymbolyjbI 
ID8113660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4156416 
End bp4157744 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content33% 
IMG OID644850026 
Producthypothetical protein 
Protein accessionYP_003001599 
Protein GI251787295 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.662274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTGAATGCGC TTGCAATTTT CTGATGGATA AAGATGCGCA GGGGTATATC 
GACCTGTCTG ATTTGGATTT AACAAGTTGT CATTTTAAAG GTGACGTTAT ATCGAAGGTG
TCTTTTTTAT CATCAAATCT ACAACATGTA ACATTCGAAT GTAAAGAAAT TGGGGATTGC
AATTTTACTA CTGCAATAGT TGATAATGTC ATATTTAGAT GTCGACGTTT ACACAATGTG
ATTTTTATCA AAGCGAGTGG TGAATGTGTC GATTTCAGCA AAAATATTCT TGATACAGTT
GACTTCTCGC AGAGTCAACT TGGTCATAGT AATTTTCGCG AATGTCAGAT TAGAAATTCA
AACTTCGATA ATTGTTATCT TTACGCTTCG CACTTCACCA GAGCAGAGTT TCTGTCTGCC
AAAGAAATAT CATTTATTAA ATCGAATTTG ACAGCTGTTA TGTTTGATTA TGTGCGAATG
TCGACAGGGA ATTTTAAAGA TTGCATTACA GAACAATTGG AATTAACTAT TGATTATTCA
GATATATTTT GGAATGAAGA TCTCGATGGT TATATCAATA ACATTATAAA AATGATTGAT
ACATTGCCAG ATAATGCAAT GATATTGAAA TCCGTTCTGG CCGTAAAACT GGTGATGCAA
TTAAAAATAC TTAATATTGT TAATAAAAAC TTTATTGAGA ATATGAAGAA AATATTTAGC
CATTGTCCTT ATATAAAAGA TCCCATTATA CGCAGTTATA TCCATTCTGA TGAAGATAAC
AAGTTCGATG ATTTTATGCG TCAACATCGA TTCAGTGAGG TGAATTTCGA TACCCAACAG
ATGATCGATT TTATTAACAG ATTTAATACG AATAAATGGC TAATTGATAA AAATAACAAT
TTTTTTATCC AACTTATCGA TCAGGCCTTA CGATCAACGG ATGATATGAT CAAAGCAAAT
GTTTGGCATC TTTATAAAGA GTGGATTCGT AGTGATGATG TTTCACCTAT ATTTATAGAA
ACTGAAGATA ATTTAAGAAC CTTTAACACG AATGAATTAA CACGAAACGA TAATATCTTT
ATCCTGTTCT CCTCAGTCGA TGATGGGCCA GTTATGGTGG TAAGCTCCCA GCGCTTACAT
GATATGTTGA ATCCTACAAA AGATACCAAT TGGAATTCCA CGTATATCTA CAAATCCAGA
CATGAGATGT TGCCTGTTAA TCTTACTCAG GAAACACTTT TCAGCTCCAA ATCTCATGGT
AAATATGCGC TTTTCCCCAT TTTTACTGCG AGTTGGCGAG CTCATCGTAT AATGAATAAG
GGTGTTTAA
 
Protein sequence
MKKIECACNF LMDKDAQGYI DLSDLDLTSC HFKGDVISKV SFLSSNLQHV TFECKEIGDC 
NFTTAIVDNV IFRCRRLHNV IFIKASGECV DFSKNILDTV DFSQSQLGHS NFRECQIRNS
NFDNCYLYAS HFTRAEFLSA KEISFIKSNL TAVMFDYVRM STGNFKDCIT EQLELTIDYS
DIFWNEDLDG YINNIIKMID TLPDNAMILK SVLAVKLVMQ LKILNIVNKN FIENMKKIFS
HCPYIKDPII RSYIHSDEDN KFDDFMRQHR FSEVNFDTQQ MIDFINRFNT NKWLIDKNNN
FFIQLIDQAL RSTDDMIKAN VWHLYKEWIR SDDVSPIFIE TEDNLRTFNT NELTRNDNIF
ILFSSVDDGP VMVVSSQRLH DMLNPTKDTN WNSTYIYKSR HEMLPVNLTQ ETLFSSKSHG
KYALFPIFTA SWRAHRIMNK GV