Gene EcSMS35_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0016 
Symbol 
ID6142614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp18454 
End bp19698 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content41% 
IMG OID641614917 
Producthypothetical protein 
Protein accessionYP_001742133 
Protein GI170682715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.969858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACCA ACGTAAATGT TGATTGTTGC AAAACACCAG GATGCAAAAA CCTGGGGTTG 
TTGAATAGCC AGGATTATAT TGCACAAGGA AAAAATATTT TATGTCGGGA ATGCGGTTAC
TTGTTTCCTG TGATATCTGA ACGATCTCTT AATATTTATC GTAATCGTGT TAATCATTCA
TGGAGAGGGT TGGTTCCACA ATGTTCAGCT TGCGGAAGCA CATCTCTGAA AAAGTATGGC
TATTCAGCTC AGGGGCAGAG GAGAATGTAT TGTCATCATT GTTCCAGAAC TTTTATCACT
CTGGATCATG TAAATACAAC GCCGCGAAGA ACACATTTAG CATTGATGAT TGATCAAGGC
GCTTCACTTG CAGATATCCG TAAATTATTA CTTCTTAATA GCACAGGGCT TAACCGTGAG
TTGATGAAAC TAGCCCGGGA AGTAAACTAT AAAGAAAGTT ACCAATACTC TTCTGCTTCT
GATATTTCTC TATCTACCCG CGCTTTTCGC GTAAAGTTTA ACGGTAGTAA TAACTATCTT
TATAGCCTTG TTACCGCAGA AGAACAAAGC GGTAGAGTCG TCGCTATTTC AACAAATTAT
TCTCCATCAG CGGTAGAGTC GCATTATCAA TATGCATCAA GTTATGAAGA GCGTTTGTCT
CCAGGTACAC TGGCGCATCA CGTACAACGC AAGGAGTTAC TTACTATGCG GCGTGATACC
TTATTTGATA TTGATTATGG TCGGGCGATA TTACATCAAA ACGATCCGGG GATGTTGGTG
AAACCGGTTC TTCCGGCGTA TCGCCATTTT GAACTGGTTA GAGCACTGAC TGAAGCATAT
TCTATAAATA TTCAACACTA CCTTGATCAC GAGTGCTTTA TATTAGGTGG TTGTCTGATG
GCTAATTTAC AGCATATTCA TCAAGGCCGG TGCCATATTT CATTTGTGAA AGAACGCGGC
GAGAAAACGA CTCATTATGA TACGCCGCCG CGGTTGTTTT TGAGTGGTGG CGTGAGAAAT
AATGTTTGGC GTACATTTTC CGCCCGTGAT TATTCAATGG CTGTTTGTAA CCTTACAGGA
AACAAGAAAG CAAATGAAAT GCGGTATTCG ACGTTAGCAA GCGCGACTCG TTTTATCAAC
TTCCTGGAGT CTCATCCCTT TCTATCCTCA TTAAACCGAA TGTCGCCTGC CAATGTTGTC
TCCACATTAG ATATTTTCAA GCATCTCTGG AATAAACAGC TATAG
 
Protein sequence
MFTNVNVDCC KTPGCKNLGL LNSQDYIAQG KNILCRECGY LFPVISERSL NIYRNRVNHS 
WRGLVPQCSA CGSTSLKKYG YSAQGQRRMY CHHCSRTFIT LDHVNTTPRR THLALMIDQG
ASLADIRKLL LLNSTGLNRE LMKLAREVNY KESYQYSSAS DISLSTRAFR VKFNGSNNYL
YSLVTAEEQS GRVVAISTNY SPSAVESHYQ YASSYEERLS PGTLAHHVQR KELLTMRRDT
LFDIDYGRAI LHQNDPGMLV KPVLPAYRHF ELVRALTEAY SINIQHYLDH ECFILGGCLM
ANLQHIHQGR CHISFVKERG EKTTHYDTPP RLFLSGGVRN NVWRTFSARD YSMAVCNLTG
NKKANEMRYS TLASATRFIN FLESHPFLSS LNRMSPANVV STLDIFKHLW NKQL