Gene EcHS_A3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3961 
SymbolyieM 
ID5591035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3955331 
End bp3956782 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content53% 
IMG OID640923068 
Producthypothetical protein 
Protein accessionYP_001460545 
Protein GI157163227 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACGC TGGATACGCT TAATGTGATG CTGGCCGTCA GCGAAGAGGG ATTGATCGAA 
GAGATGATCA TCGCGCTGCT GGCCTCACCG CAGCTGGCAG TCTTCTTTGA AAAATTCCCA
CGCCTGAAGG CAGCAATCAC TGATGATGTT CCCCGCTGGC GTGAGGCGCT GCGCAGTCGG
CTGAAAGATG CCCGAGTCCC GCCAGAACTC ACCGAAGAGG TGATGTGCTA TCAGCAAAGC
CAGCTCCTCT CCACGCCGCA GTTTATTGTG CAGCTACCAC AGATCCTGGA CTTACTGCAT
CGTCTGAATT CCCCATGGGC AGAACAAGCC CGACAGTTGG TTGATGCTAA CAGCGCGATC
ACTTCAGCGT TACACACACT TTTTCTCCAG CGTTGGCGTT TAAGTCTGAT CGTGCAAGCA
ACGACGTTAA ATCAACAGCT ATTAGAAGAA GAACGCGAAC AACTGTTAAG TGAAGTTCAG
GAACGCATGA CGCTGAGCGG ACAACTTGAA CCGATTCTCG CAGATAACAA TACCGCAGCT
GGTCGTCTGT GGGATATGAG CGCCGGCCAG CTTAAACGTG GCGACTATCA GTTGATTGTG
AAATACGGTG AATTTCTTAA CGAACAGCCG GAACTGAAAC GCCTGGCAGA GCAGCTGGGG
CGTTCTCGGG AAGCCAAATC AATACCGCGC AACGATGCGC AGATGGAAAC CTTCCGCACC
ATGGTGCGCG AACCGGCGAC GGTTCCTGAG CAGGTTGATG GTCTGCAACA AAGCGATGAT
ATTTTACGTC TCCTGCCGCC AGAACTGGCG ACACTAGGGA TAACGGAACT GGAGTATGAG
TTTTACCGTC GGCTGGTGGA AAAACAGTTG CTCACCTATC GCCTGCACGG TGAGTCGTGG
CGTGAAAAAG TGATCGAACG TCCGGTGGTA CATAAAGATT ACGATGAACA GCCGCGCGGG
CCGTTTATTG TCTGTGTGGA TACTTCCGGC TCAATGGGCG GCTTTAATGA ACAGTGTGCG
AAAGCGTTCT GCCTGGCCTT GATGCGCATT GCTCTCGCAG AAAACCGGCG CTGCTATATT
ATGCTATTTT CCACCGAGAT CGTCCGTTAT GAGCTTTCAG GCCCACAAGG CATCGAACAA
GCAATCCGTT TTTTAAGCCA GCAGTTTCGT GGCGGCACCG ATCTTGCCAG TTGTTTTCGC
GCCATTATGG AACGCTTGCA AAGCAGGGAA TGGTTTGATG CCGATGCGGT GGTGATTTCT
GATTTTATCG CTCAGCGGTT GCCTGACGAC GTGACGAGTA AAGTGAAAGA GCTGCAGCGG
GTACATCAGC ATCGCTTTCA TGCCGTGGCG ATGTCGGCAC ACGGCAAACC CGGCATCATG
CGCATTTTCG ATCATATCTG GCGCTTTGAT ACCGGGATGC GAAGCCGCCT GCTCAGACGC
TGGCGGCGAT AA
 
Protein sequence
MLTLDTLNVM LAVSEEGLIE EMIIALLASP QLAVFFEKFP RLKAAITDDV PRWREALRSR 
LKDARVPPEL TEEVMCYQQS QLLSTPQFIV QLPQILDLLH RLNSPWAEQA RQLVDANSAI
TSALHTLFLQ RWRLSLIVQA TTLNQQLLEE EREQLLSEVQ ERMTLSGQLE PILADNNTAA
GRLWDMSAGQ LKRGDYQLIV KYGEFLNEQP ELKRLAEQLG RSREAKSIPR NDAQMETFRT
MVREPATVPE QVDGLQQSDD ILRLLPPELA TLGITELEYE FYRRLVEKQL LTYRLHGESW
REKVIERPVV HKDYDEQPRG PFIVCVDTSG SMGGFNEQCA KAFCLALMRI ALAENRRCYI
MLFSTEIVRY ELSGPQGIEQ AIRFLSQQFR GGTDLASCFR AIMERLQSRE WFDADAVVIS
DFIAQRLPDD VTSKVKELQR VHQHRFHAVA MSAHGKPGIM RIFDHIWRFD TGMRSRLLRR
WRR