Gene EcSMS35_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3024 
SymbolprfB 
ID6146906 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3113984 
End bp3115082 
Gene Length1099 bp 
Protein Length365 aa 
Translation table11 
GC content53% 
IMG OID641617893 
Productpeptide chain release factor 2 
Protein accessionYP_001745044 
Protein GI170682178 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1186] Protein chain release factor B 
TIGRFAM ID[TIGR00020] peptide chain release factor 2 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAAA TTAATCCGGT AAATAATCGC ATTCAGGACC TCACGGAACG CTCCGACGTT 
CTTAGGGGGT ATCTTTGACT ATGACGCCAA GAAAGAGCGT CTGGAAGAAG TAAACGCCGA
GCTGGAACAG CCGGATGTCT GGAACGAACC CGAACGCGCA CAGGCGCTGG GTAAAGAGCG
TTCCTCCCTC GAAGCCGTTG TCGACACCCT TGACCAAATG AAACAGGGGC TGGAAGATGT
TTCTGGTCTG CTGGAACTGG CTGTAGAAGC TGACGACGAA GAAACCTTTA ACGAAGCCGT
TGCTGAACTC GACGCTCTGG AAGAAAAACT GGCGCAGCTT GAGTTCCGCC GTATGTTCTC
TGGCGAATAT GACAGCGCCG ACTGCTACCT CGATATTCAG GCGGGGTCTG GTGGTACGGA
AGCACAGGAC TGGGCGAGCA TGCTTGAGCG TATGTATCTG CGCTGGGCAG AATCGCGTGG
TTTCAAAACT GAAATCATCG AAGAGTCGGA AGGTGAAGTG GCGGGTATTA AATCCGTGAC
GATCAAAATC TCCGGCGATT ACGCTTACGG CTGGCTGCGT ACAGAAACTG GCGTTCACCG
CCTGGTGCGT AAGAGCCCGT TTGACTCCGG CGGTCGTCGC CACACGTCGT TCAGCTCCGC
GTTTGTTTAC CCGGAAGTTG ATGACGATAT TGATATCGAA ATCAACCCGG CGGATCTGCG
CATCGACGTT TATCGCGCGT CCGGCGCGGG CGGTCAGCAC GTTAACCGTA CCGAATCTGC
GGTGCGTATT ACCCACATCC CGACCGGGAT CGTGACCCAG TGCCAGAACG ACCGTTCCCA
GCACAAGAAC AAAGACCAGG CCATGAAGCA GATGAAAGCG AAGCTTTATG AACTGGAGAT
GCAGAAGAAA AATGCCGAGA AACAGGCGAT GGAAGATAAT AAATCTGATA TCGGCTGGGG
CAGCCAGATT CGTTCTTATG TCCTTGATGA CTCCCGCATT AAAGATCTGC GTACCGGGGT
AGAAACCCGC AACACGCAGG CCGTGCTGGA CGGCAGCCTG GATCAATTTA TCGAAGCAAG
TTTGAAAGCA GGGTTATGA
 
Protein sequence
MFEINPVNNR IQDLTERSDV LRGYLDYDAK KERLEEVNAE LEQPDVWNEP ERAQALGKER 
SSLEAVVDTL DQMKQGLEDV SGLLELAVEA DDEETFNEAV AELDALEEKL AQLEFRRMFS
GEYDSADCYL DIQAGSGGTE AQDWASMLER MYLRWAESRG FKTEIIEESE GEVAGIKSVT
IKISGDYAYG WLRTETGVHR LVRKSPFDSG GRRHTSFSSA FVYPEVDDDI DIEINPADLR
IDVYRASGAG GQHVNRTESA VRITHIPTGI VTQCQNDRSQ HKNKDQAMKQ MKAKLYELEM
QKKNAEKQAM EDNKSDIGWG SQIRSYVLDD SRIKDLRTGV ETRNTQAVLD GSLDQFIEAS
LKAGL