Gene EcSMS35_3990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3990 
Symbol 
ID6146307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4068346 
End bp4070055 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content54% 
IMG OID641618816 
ProductAsmA family protein 
Protein accessionYP_001745955 
Protein GI170683972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTA TTGGGAAGCT GCTTCTCTAC ATTCTCATCG CTCTGTTAGT GGTGATCGCT 
GGCCTCTATT TTCTGCTGCA AACTCGCTGG GGAGCAGAAC ATATCAGCGC ATGGGTTTCC
GAGAATAGCG ACTATCATCT GGCCTTCGGG GCGATGGATC ACCGTTTTTC CGCGCCATCT
CATATCGTGC TGGAGAACGT CACGTTTGGC CGTGATGGCC AGCCCGCGAC CCTGGTGGCA
AAAAGTGTCG ACATTGCGCT AAGCAGTCGG CAACTGACCG AACCACGCCA TGTCGATACC
ATCCTGCTGG AAAACGGGAC GCTGAATCTC ACCGACCAGA CCGCGCCGCT ACCGTTCAAA
GCCGATCGTC TGCAACTACG TGATATGGCG TTTAATAGCC CGAATAGCGA ATGGAAACTG
AGCGCGCAGC GGGTAAATGG CGGCGTGGTT CCTTGGTCAC CAGAAGCCGG TAAAGTGCTG
GGTACGAAGG CGCAGATTCA GTTTAGTGCC GGATCGCTTT CGCTCAATGA TGTTCCTGCC
ACCAATGTAC TGATTGAAGG CAGTATTGAT AACGATCGCG TTACGCTGAC TAACCTGGGT
GCCGACATCG CCCGCGGGAC ATTAACCGGA AACGCGCAGC GTAACGCTGA CGGCAGCTGG
CAAGTGGAAA ACCTGCGCAT GGCGGATATC CGTCTACAAA GCGAAAAATC GCTAACCGAC
TTCTTTGCGC CATTACGCTC TGTCCCGTCG TTGCAGATTG GTCGCCTGGA AGTCATCGAC
GCTCGTTTGC AAGGTCCGGA CTGGGCGGTG ACCGACCTCG ATCTCAGCTT GCGCAACATG
ACCTTCAGTA AAGATGACTG GCAGACACAG GAAGGTAAAC TGTCGATGAA CGCCAGCGAG
TTTATTTATG GCTCGCTGCA TTTATTTGAT CCGATTATAA ACGCGGAATT TTCCCCGCAG
GGCGTAGCAC TGCGCCAGTT CACCAGCCGC TGGGAAGGGG GCATGGTCAG AACGTCAGGG
AACTGGTTGC GTGACGGGAA AATGTTGATC CTCGATGATG CGGCAATTGC CGGACTGGAA
TATACCTTGC CAAAAAACTG GCAACAGTTG TGGATGGAAA CGACACCCGG TTGGTTAAAC
AGCCTGCAAC TGAAGAGATT TAGCGCCAGC CGCAATCTGA TCATTGATAT CGACCCTGAC
TTCCCGTGGC AGCTCACCGC GCTCGATGGT TACGGTGCCA ACCTGACGCT GGTTACCGAT
CATAAATGGG GCGTCTGGAG TGGTTCGGCG AATCTGAATG CCGCCGCCGC GACATTCAAT
CGTGTTGATG TTCGTCGCCC ATCGCTGGCG CTGACCGCCA ACAGCAGCAC GGTGAATATC
AGCGAACTGA GTGCATTTAC TGAAAAAGGC ATTCTGGAAG CCACCGCCAG TGTTTCACAA
ACGCCACAAC GTCAGACCCA TATCAGCCTG AATGGACGCG GTGTGCCGGT GAATATTTTG
CAACAGTGGG GATGGCCTGA ATTACCGTTG ACTGGCGACG GCAATATTCA GCTTACCGCC
AGCGGCAATA TTCAGGCCAA TATCCCGCTG AAACCTACGG TTAGCGGGCA ATTGCATGCC
GTGAATGCCG CAAAGCAGCA AGTGACTCAA ACCATGAATG CGGGCGTAGT TTCCAGTGGT
GAAGTTACGT CGACGGAGTC GGTGCAGTAA
 
Protein sequence
MKFIGKLLLY ILIALLVVIA GLYFLLQTRW GAEHISAWVS ENSDYHLAFG AMDHRFSAPS 
HIVLENVTFG RDGQPATLVA KSVDIALSSR QLTEPRHVDT ILLENGTLNL TDQTAPLPFK
ADRLQLRDMA FNSPNSEWKL SAQRVNGGVV PWSPEAGKVL GTKAQIQFSA GSLSLNDVPA
TNVLIEGSID NDRVTLTNLG ADIARGTLTG NAQRNADGSW QVENLRMADI RLQSEKSLTD
FFAPLRSVPS LQIGRLEVID ARLQGPDWAV TDLDLSLRNM TFSKDDWQTQ EGKLSMNASE
FIYGSLHLFD PIINAEFSPQ GVALRQFTSR WEGGMVRTSG NWLRDGKMLI LDDAAIAGLE
YTLPKNWQQL WMETTPGWLN SLQLKRFSAS RNLIIDIDPD FPWQLTALDG YGANLTLVTD
HKWGVWSGSA NLNAAAATFN RVDVRRPSLA LTANSSTVNI SELSAFTEKG ILEATASVSQ
TPQRQTHISL NGRGVPVNIL QQWGWPELPL TGDGNIQLTA SGNIQANIPL KPTVSGQLHA
VNAAKQQVTQ TMNAGVVSSG EVTSTESVQ