Gene EcHS_A3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3050 
SymbolprfB 
ID5592155 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3062697 
End bp3063795 
Gene Length1099 bp 
Protein Length365 aa 
Translation table11 
GC content53% 
IMG OID640922167 
Productpeptide chain release factor 2 
Protein accessionYP_001459669 
Protein GI157162351 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1186] Protein chain release factor B 
TIGRFAM ID[TIGR00020] peptide chain release factor 2 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAA TTAATCCGGT AAATAATCGC ATTCAGGACC TCACGGAACG CTCCGACGTT 
CTTAGGGGGT ATCTTTGACT ACGACGCCAA GAAAGAGCGT CTGGAAGAAG TAAACGCCGA
GCTGGAACAG CCGGATGTCT GGAACGAACC CGAACGCGCA CAGGCGCTGG GTAAAGAGCG
TTCCTCCCTC GAAGCCGTTG TCGACACCCT CGACCAAATG AAACAGGGGC TGGAAGATGT
TTCTGGTCTG CTGGAACTGG CTGTAGAAGC TGACGACGAA GAAACCTTTA ACGAAGCCGT
TGCTGAACTC GACGCCCTGG AAGAAAAACT GGCGCAGCTT GAGTTCCGCC GTATGTTCTC
TGGCGAATAT GACAGCGCCG ACTGCTACCT CGATATTCAG GCGGGGTCTG GCGGTACGGA
AGCACAGGAC TGGGCGAGCA TGCTTGAGCG TATGTATTTG CGCTGGGCAG AATCGCGTGG
TTTCAAAACT GAAATCATCG AAGAGTCGGA AGGTGAAGTG GCGGGTATTA AATCCGTGAC
GATCAAAATC TCCGGCGATT ACGCTTACGG CTGGCTGCGT ACAGAAACTG GCGTTCACCG
CCTGGTGCGT AAGAGCCCGT TTGACTCCGG CGGTCGTCGC CACACGTCGT TCAGCTCCGC
GTTTGTTTAT CCGGAAGTTG ATGATGATAT TGATATCGAA ATCAACCCGG CGGATTTGCG
CATTGACGTT TATCGCGCGT CCGGCGCGGG CGGTCAGCAC GTTAACCGTA CCGAATCTGC
GGTGCGTATT ACCCACATCC CGACCGGGAT CGTGACCCAG TGCCAGAACG ACCGTTCCCA
GCACAAGAAC AAAGACCAGG CCATGAAGCA GATGAAAGCG AAGCTTTATG AACTGGAGAT
GCAGAAGAAA AATGCTGAGA AACAGGCGAT GGAAGATAAC AAATCTGACA TCGGCTGGGG
CAGCCAGATT CGTTCTTATG TCCTTGATGA CTCCCGCATT AAAGATCTGC GCACCGGGGT
AGAAACCCGC AACACGCAGG CCGTGCTGGA CGGCAGCCTG GATCAATTTA TCGAAGCAAG
TTTGAAAGCA GGGTTATGA
 
Protein sequence
MFEINPVNNR IQDLTERSDV LRGYLCYDAK KERLEEVNAE LEQPDVWNEP ERAQALGKER 
SSLEAVVDTL DQMKQGLEDV SGLLELAVEA DDEETFNEAV AELDALEEKL AQLEFRRMFS
GEYDSADCYL DIQAGSGGTE AQDWASMLER MYLRWAESRG FKTEIIEESE GEVAGIKSVT
IKISGDYAYG WLRTETGVHR LVRKSPFDSG GRRHTSFSSA FVYPEVDDDI DIEINPADLR
IDVYRASGAG GQHVNRTESA VRITHIPTGI VTQCQNDRSQ HKNKDQAMKQ MKAKLYELEM
QKKNAEKQAM EDNKSDIGWG SQIRSYVLDD SRIKDLRTGV ETRNTQAVLD GSLDQFIEAS
LKAGL