Gene EcSMS35_3797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3797 
Symbol 
ID6143192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3862537 
End bp3863538 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content46% 
IMG OID641618623 
Productputative permease 
Protein accessionYP_001745763 
Protein GI170679968 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.236031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGTT GGCTTGCGAT GCTGCAAGAT GCCGCAGAGA TGTTTGTGTT TCTCGCCGTC 
GAGCTTTCTT TGCTGTTTAT AGTGATTAGT GCCGGTGTCA GCCTGATAAG ACAAAAGGTG
CCAGACCATA AAATCCAGCA GATGATGGGG GCCAGAAAAG GGAGAGGTTA TCTCCTGGCT
GCTCTGTTGG GAGCCGTTAC CCCGTTCTGT AGTTGCTCGA CAATCCCCAT GTTACGTGGA
TTGTTATCAG CGAAAGCCGG GTTTGGTCCG ACCCTCACTT TTTTATTTGT TTCCCCATTA
CTTAATCCCA TTATCGTCGG GTTAATGTGG GTGACCTTTG GCTGGAAAGT TACCTTGTTG
TACGCGATTA TCGCCGCCGG CGTCTCCGTA CTTGCCAGTA TTATCCTGGA TTCCCTGGGA
TTTGAACGTC ATATCATTGC CAGTAAAAGC TCATCAGCAA ATTGTTGTGC TCCAGCCAAA
ACTTCGCCGG GGACGACATA TACGCCAATA GAAGTGAGTT GCTGTAGTCC AACGGCTAAA
GCCATTGAGA AACCCGTAGT TAACTGTTGC AATACCAAAG CTGTGGTAAG TATTAATCCC
ATAAAACTAG CCACCAAAGA TGCGTTGCAA CAGTTTAAAG ATGTACTGCC ATATCTTTTG
TTAGGGGTAT TAATAGGCTC TTTTATTTAT GGCTTTATTC CCTCAGAGTG GATTGCCGCT
CATGCAGGGG CAGATAATCC CTTCGCCATC CCATTGAGCG CCGTTGTTGG TATTCCGCTA
TATATCCGGG CAGAGGCAGT TATCCCTCTG GCATCTGTTT TGATGACAAA AGGAATGGGT
CTGGGAGCAT TAATGGCATT AATCATCGGC AGTGCCGGCG CAAGCCTGAC GGAAGTGATA
TTACTTAAAT CAATGTTCAG AATACCGATG ATAGTTGCAT TCCTGACGGT TATATTAGGT
ATGGCTATCT TGATGGGCTA TTTGACTCAA ATGCTATTTT AA
 
Protein sequence
MSSWLAMLQD AAEMFVFLAV ELSLLFIVIS AGVSLIRQKV PDHKIQQMMG ARKGRGYLLA 
ALLGAVTPFC SCSTIPMLRG LLSAKAGFGP TLTFLFVSPL LNPIIVGLMW VTFGWKVTLL
YAIIAAGVSV LASIILDSLG FERHIIASKS SSANCCAPAK TSPGTTYTPI EVSCCSPTAK
AIEKPVVNCC NTKAVVSINP IKLATKDALQ QFKDVLPYLL LGVLIGSFIY GFIPSEWIAA
HAGADNPFAI PLSAVVGIPL YIRAEAVIPL ASVLMTKGMG LGALMALIIG SAGASLTEVI
LLKSMFRIPM IVAFLTVILG MAILMGYLTQ MLF