Gene EcSMS35_4209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4209 
Symbol 
ID6143080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4306913 
End bp4308403 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content51% 
IMG OID641619032 
Productintegral membrane protein 
Protein accessionYP_001746160 
Protein GI170682802 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000736829 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTTGAAA GTGAACTTTT AACTCAGGGT TTTTCGACAT TACTCAATAA TCCTCAGGCG 
CTGCTGTTTG CCACTTTTGG GGTGATGCTG GGTATTGTGA TTGGCGCTTT GCCTGGTCTG
ACAGCGACCA TGGGTGTGGC GATTTTGCTG CCTTTCACCT ACGGCATGGA GCCTGTTTCT
GGCTTGTTGA TGATTTGCGG CGTCTTTTTT GGTGGCGTCT ACGGTGGTTC TATCACCGCA
ATTTTGCTCA AAATTCCCGG AACGCCTGCC GCCGCAGCCA CCGCTATTGA TGGTTATGAG
CTGACGAAAC AGGGAAAAGC AGGGCTGGCA TTGAGTGCCG CCACGTTCTC TTCCTTTAGT
GGCGGAACGC TCAGCATTAT CGTGCTGATG TTTCTCTCTC CGGTACTTGC CAGTTGGGCG
CTGAAATTTA GTGCCTCGGA GTCCTTCGCC CTGGCAACCT TCGGACTGAG CATTATTGCC
AGTATTTCCG GTGAGTCACT GATTAAAGGG CTGATTGCCG GGGTTGGCGG ATTGCTGATC
GCAACGATAG GCCTTGATCC AATGGGCGGT TTTCCACGGT TTACAGGTGG ATTTGTGGAG
CTGATGAATG TGCCATTTAT CCCGGTGATG ATTGGTTTAT TTGCTGCTTC AGAAGCATTC
CGCTCTATGG AGCAAAACCA GCAAATTCGT CAGGGGGCGA AGGTGGCTAT CGGCAGTCTG
TTACTGCCCT GGCAAACACT ACGCCGCATT GCATTAACCA TTTTGCGCTC ATCAGGATTA
GGGGTTTTTA TCGGCATGAT CCCCGGTGCG GGGGCGGATA TTGCAGCTTT TGTTGCCTAT
AACGAAACTC GTCGTTTCAG TAAGACACCA GAAAACTTTG GTAAGGGTGA AATTAAAGCT
GTGGCCTCCT GTGAGGCAGG CGCGAATGGC TGCACCGGTG GCGCATTGTT ACCTATGCTG
ACGCTGGGGA TCCCTGGCGA TGCGGTAACC GCCATCATGC TGGGCGCGTT AACGTTACAG
GGAATGCAAC CAGGCCCGCT GATGTTTACC GACCACGGCG ATATGGTTTA TACACTGTTT
GTCGGCATGA TCTTCTGTTA CTTCATGCTA CTAGTTCTTG GACTGCTCTC TTTGAAAGTC
ATCGGTAATG TGGTGAAAAT TCCCGGCAAT ATTCTCACAC CGATGATCCT CGCACTTTGT
GTGGTCGGGA CTTATGCGTT GAACAATAGC CTGTTTGATG TTGGCATTAT GCTGATTGCA
GGCGTGGTGG GCTATTTCAT GCAGAAAGGA GGATATCCGG CATCACCCGT AGTGCTGGCA
TTGATTATGG GGCCAATGGC GGAAAGTAAT TTTCGCCGTG CGCTGTCGCT TTCTGGTGGG
TCACTCGACT TTCTGTATAC CCGACCGATA ACTCTGGCAT TGCTGACTCT GGCAGCCTTT
ACGCTACTGA CGCCAATAAT CCGCAAAATA ATGCGTTTAC GGCGTCAATA A
 
Protein sequence
MFESELLTQG FSTLLNNPQA LLFATFGVML GIVIGALPGL TATMGVAILL PFTYGMEPVS 
GLLMICGVFF GGVYGGSITA ILLKIPGTPA AAATAIDGYE LTKQGKAGLA LSAATFSSFS
GGTLSIIVLM FLSPVLASWA LKFSASESFA LATFGLSIIA SISGESLIKG LIAGVGGLLI
ATIGLDPMGG FPRFTGGFVE LMNVPFIPVM IGLFAASEAF RSMEQNQQIR QGAKVAIGSL
LLPWQTLRRI ALTILRSSGL GVFIGMIPGA GADIAAFVAY NETRRFSKTP ENFGKGEIKA
VASCEAGANG CTGGALLPML TLGIPGDAVT AIMLGALTLQ GMQPGPLMFT DHGDMVYTLF
VGMIFCYFML LVLGLLSLKV IGNVVKIPGN ILTPMILALC VVGTYALNNS LFDVGIMLIA
GVVGYFMQKG GYPASPVVLA LIMGPMAESN FRRALSLSGG SLDFLYTRPI TLALLTLAAF
TLLTPIIRKI MRLRRQ