Gene EcSMS35_3542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3542 
SymbolcafA 
ID6147375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3622971 
End bp3624440 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content52% 
IMG OID641618371 
Productribonuclease G 
Protein accessionYP_001745518 
Protein GI170679965 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.721219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTG AATTGTTAGT AAACGTAACG CCTTCGGAAA CGCGAGTGGC GTATATTGAT 
GGCGGTATTC TGCAGGAAAT TCATATTGAA CGTGAGGCGC GACGCGGAAT AGTAGGCAAT
ATCTACAAGG GTCGTGTAAG TCGTGTACTT CCGGGTATGC AGGCGGCTTT TGTAGATATT
GGGCTGGATA AAGCCGCGTT TCTTCATGCA TCCGACATCA TGCCGCACAC CGAATGTGTG
GCGGGTGAAG AACAAAAGCA ATTCACGGTG CGCGACATCT CGGAGCTGGT CCGTCAGGGG
CAAGATCTGA TGGTGCAGGT GGTGAAAGAT CCGCTTGGCA CTAAAGGTGC GCGCCTGACC
ACCGATATCA CGCTGCCTTC TCGCTATCTG GTGTTTATGC CAGGTGCTTC TCACGTTGGG
GTATCCCAAC GTATTGAAAG CGAATCAGAA CGTGAACGCC TGAAAAAAGT GGTCGCAGAG
TATTGTGACG AGCAGGGCGG GTTTATCATC CGTACTGCAG CGGAAGGGGT TGGCGAGGCT
GAACTGGCCT CCGATGCCGC TTATCTGAAA CGCGTCTGGA CCAAAGTTAT GGAGCGTAAA
AAGCGCCCAC AGACCCGTTA TCAGCTGTAC GGCGAACTGG CGCTGGCGCA GCGTGTTCTG
CGTGATTTCG CCGACGCCGA ACTCGACCGC ATTCGCGTTG ACTCACGCCT GACTTACGAA
GCATTGCTGG AGTTTACCTC GGAGTACATT CCCGAGATGA CCAGCAAGCT GGAGCATTAC
ACCGGACGCC AGCCGATTTT CGATCTCTTT GATGTCGAAA ACGAAATTCA GCGAGCGCTG
GAACGCAAAG TAGAACTGAA ATCCGGTGGC TATCTGATTA TCGACCAGAC CGAAGCGATG
ACCACCGTGG ACATCAATAC CGGGGCGTTT GTCGGTCATC GCAATCTGGA CGACACCATT
TTCAATACCA ATATTGAAGC GACGCAGGCT ATCGCTCGCC AGTTACGGTT GCGCAATCTG
GGCGGGATTA TCATTATTGA TTTCATCGAC ATGAATAATG AAGATCACCG TCGCCGCGTG
TTGCATTCGC TGGAGCAGGC GTTGAGCAAA GACCGGGTGA AAACCAGCGT TAATGGTTTT
TCGGCGCTGG GGCTGGTGGA GATGACGCGT AAACGCACCC GCGAAAGCAT TGAGCACGTA
CTGTGTAACG AATGCCCAAC CTGCCACGGT CGCGGAACGG TGAAAACCGT GGAAACGGTA
TGCTATGAAA TCATGCGCGA GATTGTTCGT GTCCACCATG CTTACGACTC CGACCGTTTC
CTGGTCTATG CTTCTCCGGC AGTAGCCGAA GCCTTGAAAG GCGAAGAGTC ACACTCGCTG
GCGGAAGTGG AAATTTTCGT TGGCAAACAG GTTAAAGTAC AAATTGAACC GCTCTATAAC
CAGGAGCAGT TTGACGTCGT AATGATGTAA
 
Protein sequence
MTAELLVNVT PSETRVAYID GGILQEIHIE REARRGIVGN IYKGRVSRVL PGMQAAFVDI 
GLDKAAFLHA SDIMPHTECV AGEEQKQFTV RDISELVRQG QDLMVQVVKD PLGTKGARLT
TDITLPSRYL VFMPGASHVG VSQRIESESE RERLKKVVAE YCDEQGGFII RTAAEGVGEA
ELASDAAYLK RVWTKVMERK KRPQTRYQLY GELALAQRVL RDFADAELDR IRVDSRLTYE
ALLEFTSEYI PEMTSKLEHY TGRQPIFDLF DVENEIQRAL ERKVELKSGG YLIIDQTEAM
TTVDINTGAF VGHRNLDDTI FNTNIEATQA IARQLRLRNL GGIIIIDFID MNNEDHRRRV
LHSLEQALSK DRVKTSVNGF SALGLVEMTR KRTRESIEHV LCNECPTCHG RGTVKTVETV
CYEIMREIVR VHHAYDSDRF LVYASPAVAE ALKGEESHSL AEVEIFVGKQ VKVQIEPLYN
QEQFDVVMM