Gene B21_03573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03573 
SymbolyieM 
ID8113563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3815030 
End bp3816481 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content53% 
IMG OID644849741 
Producthypothetical protein 
Protein accessionYP_003001314 
Protein GI251787010 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.585554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACGC TGGATACGCT TAATGTGATG CTGGCCGTCA GCGAAGAGGG ATTGATCGAA 
GAGATGATCA TCGCGCTGCT GGCCTCACCG CAGCTGGCAG TCTTCTTTGA AAAATTCCCA
CGCCTGAAGG CAGCAATCAC TGATGATGTT CCCCGCTGGC GTGAGGCGCT GCGCAGTCGG
CTGAAAGATG CCCGAGTCCC GCCAGAACTC ACCGAAGAGG TGATGTGCTA TCAGCAAAGC
CAGCTCCTCT CCACGCCGCA GTTTATTGTG CAGCTACCAC AGATCCTGGA CTTACTGCAT
CGTCTGAATT CCCCATGGGC AGAACAAGCC CGACAGTTGG TTGATGCTAA CAGCGCGATC
ACTTCAGCGT TACACACACT TTTTCTCCAG CGTTGGCGTT TAAGTCTGAT CGTGCAAGCA
ACGACGTTAA ATCAACAGCT ATTAGAAGAA GAACGCGAAC AACTGTTAAG TGAAGTTCAG
GAACGCATGA CGCTGAGCGG ACAACTTGAA CCGATTCTCG CAGATAACAA TACCGCAGCT
GGTCGTCTGT GGGATATGAG CGCCGGCCAG CTTAAACGTG GCGACTATCA GTTGATTGTG
AAATACGGTG AATTTCTTAA CGAACAGCCG GAACTGAAAC GCCTGGCAGA GCAGCTGGGG
CGTTCTCGGG AAGCCAAATC AATACCGCGC AACGATGCGC AGATGGAAAC CTTCCGCACC
ATGGTGCGCG AACCGGCGAC GGTTCCTGAG CAGGTTGATG GTCTGCAACA AAGCGATGAT
ATTTTACGTC TCCTGCCGCC AGAACTGGCG ACACTAGGGA TAACGGAACT GGAGTATGAG
TTTTACCGTC GGCTGGTGGA AAAACAGTTG CTCACCTATC GCCTGCACGG TGAGTCGTGG
CGTGAAAAAG TGATCGAACG TCCGGTGGTA CATAAAGATT ACGATGAACA GCCGCGCGGG
CCGTTTATTG TCTGTGTGGA TACTTCCGGC TCAATGGGCG GCTTTAATGA ACAGTGTGCG
AAAGCGTTCT GCCTGGCCTT GATGCGCATT GCTCTCGCTG AAAACCGGCG CTGCTATATT
ATGCTATTTT CCACCGAGAT CGTCCGTTAT GAGCTTTCAG GCCCACAAGG CATCGAACAA
GCAATCCGTT TTTTAAGCCA GCAGTTTCGT GGCGGCACCG ATCTTGCCAG TTGTTTTCGC
GCCATTATGG AACGCTTGCA AAGCAGGGAA TGGTTTGATG CCGATGCGGT GGTGATTTCT
GATTTTATCG CTCAGCGGTT GCCTGACGAC GTGACGAGTA AAGTGAAAGA GCTGCAGCGG
GTACATCAGC ATCGCTTTCA TGCCGTGGCG ATGTCGGCAC ACGGCAAACC CGGCATCATG
CGCATTTTCG ATCATATCTG GCGCTTTGAT ACCGGGATGC GAAGCCGCCT GCTCAGACGC
TGGCGGCGAT AA
 
Protein sequence
MLTLDTLNVM LAVSEEGLIE EMIIALLASP QLAVFFEKFP RLKAAITDDV PRWREALRSR 
LKDARVPPEL TEEVMCYQQS QLLSTPQFIV QLPQILDLLH RLNSPWAEQA RQLVDANSAI
TSALHTLFLQ RWRLSLIVQA TTLNQQLLEE EREQLLSEVQ ERMTLSGQLE PILADNNTAA
GRLWDMSAGQ LKRGDYQLIV KYGEFLNEQP ELKRLAEQLG RSREAKSIPR NDAQMETFRT
MVREPATVPE QVDGLQQSDD ILRLLPPELA TLGITELEYE FYRRLVEKQL LTYRLHGESW
REKVIERPVV HKDYDEQPRG PFIVCVDTSG SMGGFNEQCA KAFCLALMRI ALAENRRCYI
MLFSTEIVRY ELSGPQGIEQ AIRFLSQQFR GGTDLASCFR AIMERLQSRE WFDADAVVIS
DFIAQRLPDD VTSKVKELQR VHQHRFHAVA MSAHGKPGIM RIFDHIWRFD TGMRSRLLRR
WRR