Gene EcSMS35_4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4387 
Symbol 
ID6144833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4475606 
End bp4476733 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content48% 
IMG OID641619208 
Producthypothetical protein 
Protein accessionYP_001746332 
Protein GI170683638 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAATC TGTTTGCGAT TAATGTCTGT AAAACGTTCG GTTGTCGAAA TTTGGGATTG 
GCCTCATCGG AAGATTACAG TTGGCCAGAC TATAAACTGG GTTTTCCGGC ACTGCACTGC
CAGGCATGCG GAAGTTACCC CCCTTTGTTT GATGAACAAC AATTTCGTGA CTGGTTGTCA
GTTCACATGA CTGCGTGTGC CATAGAAACA GGGCATTTTT GTCCATCATG CTATTGTAAA
GAAAAGATTT TTTATGGCCA CAACCCGCAG GGAACACAAC GCGTTCAATG CCGTTCATGC
AAAAAAGTCT GGACTCCGAA ACAGCAGCCA GCAAGAGAAA TTACGTATCC CCAGTCCATT
GAAACCGTTC AACTGTTCAT GCCCTTTCAA GGAGCCAGCG CAGTACAAAA GCTCTATGTT
TTAGTCAGTC TTGACGCCAC TCGTGGCAAC ATCCTTCATC TCTCCACGAA TTATACGCAA
CATCAGACAG GAGACAGTCT GCGGTACAGT TACAGAGGCA ATACAGAACC AACTGAGCAC
CATAGCGATA TTGTGCAGAG GGTAGATATG CGTGAAGCGC AATTCTTGCG CCGGAGCCAG
TTCGATGAAA TTCAGTATGG CAGTGCTGTG CTCAAGCGCA ACGGCAAGGG AGCCATACTA
CGCCCGGTTA TCACGGCACA CGGGCATTTC AGAATACTGA AAATCCGCTT TCCACATGTC
AAAACGCATA TCATTTCACA CGAATGTTTT CTGAGAGGCG CAATTATTAC AGCCTGGGCA
GATCAGTTCC GCCAACAACA AGGCGAACTT TGGTTCGTAG AAGAAGAAAT CAGCGACCGT
AACGCTGATA TTCCCTGGCA TTTTCAGGGA ACGACATACC ATGGTTGGTG GCAAAATCAG
TGGCAACGCT GGGGGCAGGG GAATAACAGC AAGATGGTCT GCCTGCTCAC AGGAGTCTCC
TTAGAAAGGG GCGCAAATGT TTCTCTGGCA ACCAGTCGTT GCTTTATCAC ATGGCTGACA
GACCAACACG ACTTTACCCA AAGCGCGTTA TTATCCGCAG GTCGCGTAAC GAAAATGCTG
ACCTCACTGG CGTTAAAATA CAATGAATCG CTCACTCCAT CTTGTTAG
 
Protein sequence
MSNLFAINVC KTFGCRNLGL ASSEDYSWPD YKLGFPALHC QACGSYPPLF DEQQFRDWLS 
VHMTACAIET GHFCPSCYCK EKIFYGHNPQ GTQRVQCRSC KKVWTPKQQP AREITYPQSI
ETVQLFMPFQ GASAVQKLYV LVSLDATRGN ILHLSTNYTQ HQTGDSLRYS YRGNTEPTEH
HSDIVQRVDM REAQFLRRSQ FDEIQYGSAV LKRNGKGAIL RPVITAHGHF RILKIRFPHV
KTHIISHECF LRGAIITAWA DQFRQQQGEL WFVEEEISDR NADIPWHFQG TTYHGWWQNQ
WQRWGQGNNS KMVCLLTGVS LERGANVSLA TSRCFITWLT DQHDFTQSAL LSAGRVTKML
TSLALKYNES LTPSC