Gene EcSMS35_3700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3700 
SymbolrtcA 
ID6147489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3763365 
End bp3764381 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content60% 
IMG OID641618527 
ProductRNA 3'-terminal-phosphate cyclase 
Protein accessionYP_001745667 
Protein GI170680706 
COG category[A] RNA processing and modification 
COG ID[COG0430] RNA 3'-terminal phosphate cyclase 
TIGRFAM ID[TIGR03399] RNA 3'-phosphate cyclase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0886189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGA TGATTGCGCT GGATGGCGCA CAGGGCGAAG GCGGCGGGCA GATCCTGCGC 
TCGGCGCTGA GCCTGTCGAT GATAACCGGC CTGCCATTTA CCATCACCGG CATTCGTGCC
GGGCGGGCAA AACCGGGACT GTTGCGCCAG CATCTGACCG CGGTAAAAGC GGCTGCGGAA
ATTTGTAGGG CAACGGTGGA AGGTGCGGAG CTGGGATCGC AGCGTCTGCT CTTCCGGCCC
GGCACCGTGC GCGGCGGCGA TTACCGCTTT GCTATCGGTA GCGCCGGAAG TTGTACGCTG
GTGCTGCAAA CGGTGCTGCC CGCGCTGTGG TTTGCCGATG GACCTTCGCG TGTTGAAGTG
AGCGGAGGCA CCGATAACCC GTCGGCCCCG CCTGCGGATT TTATCCGCCG GGTGCTGGAG
CCGCTGCTGG CGAAAATGGG CATTCATCAG CAAACCACAC TAATACGCCA CGGTTTTTAT
CCTGCCGGAG GCGGGGTGGT GGCAACGGAA GTCTCGCCCG TGGCATTGTT TAACACCTTG
CAACTTGGCG AGCGCGGGAA CATTGTGCAG ATGCGTGGAG AGGTGCTATT AGCTGGCGTG
CCGAGGCATG TTGCTGAGCG TGAAATCGCT ACGCTGGTGG GGAGTTTTTC CCTGCATGAG
CAGAATATTC ATAACCTGCC GCGTGACCAG GGGCCGGGTA ATACCGTCTC GCTTGAAGTC
GAAAGTGAAA ATATCACCGA ACGCTTTTTT GTCGTCGGTG AAAAGCGCGT CAGCGCTGAG
GTGGTCGCGG CACAGTTGGT GAAAGAGGTG AAACGCTACC TGGCAAGCCC GGCGGCGGTG
GGGGAATATC TCGCCGACCA ACTGGTGCTA CCGATGGCGC TGGCGGGCGC GGGAGAATTT
ACGGTCGCCC ATCCCTCATG CCATCTGCTG ACCAATATTG CGGTGGTGGA GCGTTTCTTG
CCAGTGCGGT TTGGTCTGGT GGAGGCTGAT GGCGTAACGC GGGTGAGCAT TGAATGA
 
Protein sequence
MKRMIALDGA QGEGGGQILR SALSLSMITG LPFTITGIRA GRAKPGLLRQ HLTAVKAAAE 
ICRATVEGAE LGSQRLLFRP GTVRGGDYRF AIGSAGSCTL VLQTVLPALW FADGPSRVEV
SGGTDNPSAP PADFIRRVLE PLLAKMGIHQ QTTLIRHGFY PAGGGVVATE VSPVALFNTL
QLGERGNIVQ MRGEVLLAGV PRHVAEREIA TLVGSFSLHE QNIHNLPRDQ GPGNTVSLEV
ESENITERFF VVGEKRVSAE VVAAQLVKEV KRYLASPAAV GEYLADQLVL PMALAGAGEF
TVAHPSCHLL TNIAVVERFL PVRFGLVEAD GVTRVSIE