Gene EcSMS35_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0289 
Symbol 
ID6146989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp296996 
End bp298135 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content52% 
IMG OID641615186 
Producthypothetical protein 
Protein accessionYP_001742395 
Protein GI170682374 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03073] release factor H-coupled RctB family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.15887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAAT ATATTCGTCC CTTATCTGAT GCGGTACTTA CCATCGCATC TGATGACCTG 
TGGATCGAGA GTTCAGCGAT CCAACAATTA CACACCACGG CAAATTTACC CGACATGCAA
CGCGTAGTAG GGATGCCAGA TTTACACCCC GGACGCGGCT ATCCGATTGG CGCAGCGTTC
TTCTCCGTAG GCCGTTTTTA CCCGGCACTG GTCGGCAATG ATATCGGCTG CGGTATGGCG
CTATGGCAAA CAGATATTTT CGCGCGCAAA TACAACGCCG ATAAGTTTGA AAAGCGATTA
TCTGCGCTGG ATGACGTTGC TGAAGAAAGC TGGCTGGAGG AAAACCTGCC GTCAGCGTTA
GCACAGCATC CGTGGCGCAG CTCGCTGGGT TCCATCGGTG GCGGAAACCA CTTCGCAGAA
CTGCAACAAG TTGATGAAAT TATCGACGCT GAACTGTTTG CACTGACAGG TCTGGATGCG
CAGCATCTGC AACTGCTGGT TCATAGCGGC TCGCGGGGTT TGGGCCAGTC TATTTTACAG
CGGCATATTG CCTCGTTTTC GCATCATGGT TTGCCTGAAG GCAGTGACGA CGCGCTAAGT
TATATTGTGG AACATGATGA TGCGCTGGCG TTTGCGCGTA TTAATCGCCA GCTGATCGCT
TTGCGCATAA TGCAACAGAT TAAGGCCAAC GGTAATTCGG TTCTGGATGT GGCGCATAAC
TTTGTTAGCG CGTGTCGAAT CGGTGATCAA CAGGGCTGGT TGCATCGTAA AGGTGCCACA
CCGGATGACA ACGGTCTGGT GATTATTCCC GGTTCACGCG GTGATTACTC CTGGCTGGTT
CAGCCCGTCA GGAGTGAGGA AACATTGCAT TCGCTGGCGC ATGGGGCTGG GCGTAAATGG
GGGCGCACCG AGTGTAAAGG GCGTCTGGCA GCGAAATACA CAGCGACGCA GCTCTCACGT
ACTGAACTTG GCAGCCGGGT AATTTGTCGC GATAAACAAC TCATCTTTGA AGAAGCGCCA
CAAGCTTATA AATCGGCTGA AAGCGTGGTG CAATGTCTGG TGCAGGCTGG GTTAATTATT
CCTGTCGCGC GACTGCGTCC GGTGCTAACG CTCAAAAACA GTGGAGGGAA AAAAGGATGA
 
Protein sequence
MGKYIRPLSD AVLTIASDDL WIESSAIQQL HTTANLPDMQ RVVGMPDLHP GRGYPIGAAF 
FSVGRFYPAL VGNDIGCGMA LWQTDIFARK YNADKFEKRL SALDDVAEES WLEENLPSAL
AQHPWRSSLG SIGGGNHFAE LQQVDEIIDA ELFALTGLDA QHLQLLVHSG SRGLGQSILQ
RHIASFSHHG LPEGSDDALS YIVEHDDALA FARINRQLIA LRIMQQIKAN GNSVLDVAHN
FVSACRIGDQ QGWLHRKGAT PDDNGLVIIP GSRGDYSWLV QPVRSEETLH SLAHGAGRKW
GRTECKGRLA AKYTATQLSR TELGSRVICR DKQLIFEEAP QAYKSAESVV QCLVQAGLII
PVARLRPVLT LKNSGGKKG