Gene EcSMS35_2426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2426 
Symbol 
ID6147313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2473374 
End bp2475140 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content47% 
IMG OID641617298 
Productvon Willebrand factor type A domain-containing protein 
Protein accessionYP_001744470 
Protein GI170681089 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAATA AAAATATAAT CATGTTGCTT ATGAGTAGTT TGATTTTGTC AGGATGTGGG 
CCGGAATCTG AGAATAAGGA AAGTCTGCAA CAACAACCCA GTACTCCCAC GGACCAGCAA
GTGCTTGCCG CGCAACATGC TGCAATAAAA GAGGCCGAGC AAAGATCCGT CGCCGCTAAA
GCCACTGCAG ACGCGAAAGC AAAAGCCTTA GCCCAGCAAG AAGCGCAACA ATATTCAGAC
AAACAGGCTT TGCAGGGGCG CTTACAGGCG GCTCCAAAAT ATCAACATGC AGCCAGAGAA
AAAGCAGCCT CCCAAATCGC AAATCCAGGA ACCGCTCGCT ATCAGCAGTT CGATGATAAT
CCGGTTAAGC AGGTAGCGCA AAACCCGTTG GCGACATTTA GCCTTGACGT CGACACCGGC
AGTTATGCGA ATGTACGCCG TTTTCTCAAT CATGGGCAGT TGCCTCCGCC AGATGCAGTG
CGGGTGGAGG AGATGGTGAA TTATTTTCCG TCAGACTGGG TTATTAATGA TAAAAGTAAT
AATAAAGAAC CTGTTCCCGC CAGTAAGCCA ATACCTTTCG CTATGCGCTA CGAACTGGCA
CCTGCGCCGT GGAATGAACA GCGAACATTG CTGAAAGTTG ATATCCTGGC GAAAGATCGC
AAAAGTGAAG AGTTACCAGC TTCTAATCTG GTCTTTCTTA TCGACACTTC TGGTTCAATG
ATTTCTGATG AACGTTTGCC ACTTATCCAG TCTTCGTTGA AATTATTGGT CAAAGAACTT
CGTGAGCAGG ATAACATTGC CATCGTGACC TATGCTGGCG ACTCCCGTAT TGCGTTGCCT
TCTATCTCCG GGAGTCACAA GGCGGAAATT AATGCCGCAA TTGATTCGCT GGATGCCGAT
GGCAGTACCA ATGGCGGTGC CGGGCTGGAA CTGGCTTATC AGCAGGCGGC GAAGGGGTTT
ATTAAGGGTG GCATCAATCG CATTTTATTG GCCACTGATG GTGATTTTAA CGTCGGCATT
GACGATCCTA AATCGATTGA ATCAATGGTC AAAAAACAGC GGGAGTCCGG TGTTTCTCTG
TCCACGTTTG GCGTGGGGGA TAGCAATTAC AACGAGGCAA TGATGGTGCG AATCGCCGAT
GTTGGCAACG GTAACTACAG CTACATTGAT ACCCTCGCTG AAGCACAGAA AGTATTGAAC
AGTGAAATGC GGCAGACGTT GATTAGCGTA GCAAAAGATG TCAAAGCGCA AATTGAGTTT
AACCCCGCGT GGGTAACGGA ATACCGTCAG ATTGGTTATG AAAAGCGTCA ACTTCGGGCG
GAAGATTTTA ATAACGACAA CGTTGATGCG GGTGATATTG GCGCAGGTAA ACATATAACG
TTGTTATTCG AATTAACGCT GAAAGGGCAA AAAGCATCAA TTGATAAGTT ACGCTATGCC
CCGGATAACA AATCAGCGAA ATCGGACAAA ACAAAAGAAC TTGCCTGGTT AAAGATTCGT
TGGAAATACC CGCAGGGAAA AGAAAGTCAG TTAGTTGAAT TCCCGCTGGG GCCAACAATA
AACGCGCCCT CTGAAGATAT GCGTTTTCGC GCAGCAGTAG CTGCATATGG GCAAAAGTTA
CGCGGTTCTG AATACCTGGA TGAAATCTCA TGGCAGCAGA TAAAACAGTG GGCTCAGCAG
GCAAAAGGGG AAGATCCACA GGGTTACAGG GCGGAATTTA TTCGCCTGGT TGAACTGGCG
GGTGGTGTGA CTGACATCAG TCAGTGA
 
Protein sequence
MRNKNIIMLL MSSLILSGCG PESENKESLQ QQPSTPTDQQ VLAAQHAAIK EAEQRSVAAK 
ATADAKAKAL AQQEAQQYSD KQALQGRLQA APKYQHAARE KAASQIANPG TARYQQFDDN
PVKQVAQNPL ATFSLDVDTG SYANVRRFLN HGQLPPPDAV RVEEMVNYFP SDWVINDKSN
NKEPVPASKP IPFAMRYELA PAPWNEQRTL LKVDILAKDR KSEELPASNL VFLIDTSGSM
ISDERLPLIQ SSLKLLVKEL REQDNIAIVT YAGDSRIALP SISGSHKAEI NAAIDSLDAD
GSTNGGAGLE LAYQQAAKGF IKGGINRILL ATDGDFNVGI DDPKSIESMV KKQRESGVSL
STFGVGDSNY NEAMMVRIAD VGNGNYSYID TLAEAQKVLN SEMRQTLISV AKDVKAQIEF
NPAWVTEYRQ IGYEKRQLRA EDFNNDNVDA GDIGAGKHIT LLFELTLKGQ KASIDKLRYA
PDNKSAKSDK TKELAWLKIR WKYPQGKESQ LVEFPLGPTI NAPSEDMRFR AAVAAYGQKL
RGSEYLDEIS WQQIKQWAQQ AKGEDPQGYR AEFIRLVELA GGVTDISQ